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Summary of Changes 



The following summarizes the changes made to the former (September 1988) 
edition of this manual. 

New Assembler Functions. Added truncate and round, and 
conditional trap assembler functions to table on pp. 6-3 and 6-4. 

New Pseudo-Ops. The ,aent name, .alias, and .noalias have 
been added to the description of pseudo-ops on pp. 8-1 and 8-5. 

New Constant. A description of the constant sclnit was added 
to the table on p. 10-17. 

Reorganization Constraints and Rules. Added a cross- 
reference to the sections of the MIPS Architecutre book that de- 
scribe the constraints and rules for using load, jump, and branch 
instructions. 

General. Numerous minor technical and editorial corrections 
have been made through the manual. 
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About This Book 



This book describes the assembly language supported by the RISCompiler sys- 
tem, its syntax rules, and how to write some assembly programs. For information 
about assembling and linking a program written in assembly language, see the 
Languages Programmer's Guide. 

The assembler converts assembly language statements into machine code. In 
most assembly languages, each instruction corresponds to a single machine in- 
struction; however, some assembly language instructions can generate several 
machine instructions. This feature results in assembly programs that can run 
without modification on future machines, which might have different machine 
instructions. See Appendix B for more information about assembler instructions 
that generate multiple machine instructions. 

Who Should Read This Book? 

This book assumes that you are an experienced assembly language programmer. 

The assembler exists primarily to produce object modules from the assembly in- 
structions that the C, Fortran 77, and Pascal compilers generate. It therefore 
lacks many functions normally present in assemblers. Therefore, we recommend 
that you use the assembler only when you need to: 

• Maximize the efficiency of a routine, which might not be possi- 
ble in C, Fortran 77, Pascal, or another high-level language— for 
example, to write low-level I/O drivers. 

• Access machine functions unavailable from high-level lan- 
guages or satisfy special constraints such as restricted register 
usage. 

• Change the operating system. 

• Change the compiler system. 

What Does This Book Cover? 

This book has these chapters: 

Chapter 1 —Registers describes the format for the general registers, the 
special registers, and the floating point registers. 

Chapter 2 — Addressing describes how addressing works. 

Chapter 3— Exceptions describes exceptions you might encounter with as- 
sembly programs. 

Chapter 4 — Lexical Conventions describes the lexical conventions that the 
assembler follows. 
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Chapter 5 — Instruction Set describes the main processor's instruction set, 
including notation, load and store instructions, computational instructions, 
and jump and branch instructions. 

Chapter 6— Coprocessor Instruction Set describes the coprocessor instruc- 
tion sets. 

Chapter 7— Linkage Conventions describes linkage conventions for all 
supported high-level languages. It also discusses memory allocation and 
register use. 

Chapter 8— Pseudo-Op-Codes describes the assembler's pseudo-opera- 
tions (directives). 

Chapter 9— Object File Format provides an overview of the components 
comprising the object file and describes the headers and sections of the ob- 
ject file. 

Chapter 10— Symbol Table describes the purpose of the Symbol Table and 
the format of entries in the table. This chapter also lists the symbol table rou- 
tines that are supplied. 

Appendix A— Instruction Summaries summarizes all assembler instruc- 
tions. 

Appendix B— Basic Machine Definition describes instructions that gener- f 

ate more than one machine instruction. V 

For More Information 

As you use this manual, consult the following book: 

• Languages Programmer's Guide (Order number 3200DOC) 

• MIPS RISC Architecture (Order number 31111 DOC) 
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Registers 



This chapter discusses the registers and describes how memory organization af- 
fects them. Refer to Chapter 6 for information regarding register use and link- 
age. 

The machine has these registers: 

• General registers, which are always one word wide 

• Coprocessor registers (for example, floating point registers) 

• Two special registers that hold the results of multiplication and 
division instructions 

You must use general registers where the assembly instructions expect general 
registers and floating point registers where the assembly instructions expect 
floating point registers. If you confuse the two, the assembler issues an error 
message. 



Register Format 



A machine's byte ordering scheme (or endian issues) affects memory organiza- 
tion and defines the relationship between address and byte position of data in 
memory. R2000 machines can be big-endian or little-endian. Big-endian ma- 
chines store the sign bit in the lowest address byte. Little-endian machines store 
the sign bit in the highest address byte. Before you use the assembler, determine 
whether you have a big-endian or a little-endian machine. (There are no assem- 
bler instructions that depend on bit numbering; it is a software convention.) 



Big-Endian Machines 



Big-endian machines number the bytes of a word from to 3. Byte holds the 
sign and most significant bits. 

For halfwords, big-endian machines number the bytes from to 1. Byte holds 
the sign and most significant bits. 

Big-endian machines number the bits of each byte from to 7, using this format: 



• Bit holds the least significant bit. 

• Bit 7 holds the most significant bit. 
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Bit: 31 .... 24 



23 .... 16 15 ... 8 
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byte 3 
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/ 




least significant 
bit 
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Figure LI Big-endian byte ordering. 



Llttle-Endian Machines 



Little-endian machines number the bytes of a word from 3 to 0. Byte 3 holds the 
sign and most significant bits. 

For halfwords, little-endian machines number the bytes from 1 to 0. Byte 1 
holds the sign and most significant bits. 

Little-endian machines number the bits of each byte from 7 to ? using this for- 
mat: 

• Bit holds the least significant bit. 

• Bit 7 holds the most significant bit. 
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Bit: 



31 .... 24 



23 .... 16 15 ... 8 
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Figure 1.2 Little-endian byte ordering. 



General Registers 



Each general register has 32 bits. The assembler reserves all register names, and 
you must use lowercase for the names. All register names start with a dollar sign 
($). 

The general registers have the names $0..$31 . By including the file regdef.h (use 
#include /usr/include/regdef.h) in your program, you can use software names for 
some general registers. The operating system and the assembler use the general 
registers $1, $26, $27, $28, and $29 for specific purposes. (NOTE: Attempts to 
use these general registers in other ways can produce unexpected results.) If a 
program uses the names $1, $26, $27, $28, $29 rather than the names $at, $ktO, 
$ktl, $gp, $sp respectively, the assembler issues warning messages. 

General register $0 always contains the value 0. All other general registers are 
equivalent, except that general register $31 also serves as the implicit link regis- 
ter for jump and link instructions. See Chapter 6 for a description of register 
assignments. 
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Table 1 J General Registers 



Register Name 


Software Name 
(from regdef.h) 


Use and Linkage 


$0 




always has the value 


$at 




reserved for the assembler 


$2..$3 


vO-vl 


used for expression evaluations and to hold 
the integer type function results. Also used 
to pass the static link when calling nested 
procedures. 


$4..$7 


a0-a3 


used to pass the first 4 words of integer 
type actual arguments, their values are 
not preserved across procedure calls 


$8,.$15 


tO-t7 


temporary registers used for expression evalu- 
ations; their values aren't preserved across 
procedure calls. 


$16..$23 


sQ~s7 


saved registers. Their values must be pre- 
served across procedure calls. 


$24,.$25 


t8-t9 


temporary registers used for expression evalu- 
ations; their values aren't preserved across 
procedure calls. 


$kt0..$ktl 


kO-kl 


reserved for the operating system kernel 


$28or$gp 


gP 


contains the global pointer 


$29 or $sp 


sp 


contains the stack pointer 


$30or$fp 


* 


contains the frame pointer (if needed); 
otherwise a saved register (like s0-s7) 


$31 


ra 


contains the return address and used 
for expression evaluation 
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Special Registers 



The machine has two 32 bit special registers. The hi and lo special registers hold 
the results of the multiplication (mult and multu) and division (div and divu) 
instructions. 

You usually do not need to refer explicitly to these special registers. Instructions 
that use the special registers refer to them automatically. 

Table 1.2 Special Registers 



Name 


Description 


hi 
lo 


Multiply/Divide special register holds the most significant 
32 bits of multiply, remainder of divide 
Multiply/Divide special register holds the least significant 
32 bits of multiply, quotient of divide 



Floating Point Registers 



Floating point registers provide for single-precision and double precision float- 
ing point operations as described below. Chapter 6 describes floating point reg- 
ister use. 



Single Precision Floating Point Registers 



Single precision floating point values (32-bits) require one floating point regis- 
ter. Single precision floating point instructions must refer to the even-numbered 
floating point registers. 



Double Precision Floating Point Registers 



Double precision floating point values (64 bits) require two floating point regis- 
ters. Double precision floating point instructions must refer to the even-num- 
bered floating point registers. 
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Figure J. 3 Floating Point Register Set 
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This chapter describes the formats that you can use to specify addresses. The 
machine uses a byte addressing scheme. Access to halfwords requires alignment 
on even byte boundaries, and access to words requires alignment on byte bounda- 
ries that are divisible by four. Any attempt to address a data item that does not 
have the proper alignment causes an alignment exception. 

The unaligned assembler load and store instructions may generate multiple ma- 
chine language instructions. They do not raise alignment exceptions. 

These instructions load and store unaligned data: 

• load word left (lwl) 

• load word right (Iwr) 

• store word left (swl) 

• store word right (swr) 

• unaligned load word (ulw) 

• unaligned load halfword (ulh) 

• unaligned load halfword unsigned (ulhu) 

• unaligned store word (usw) 

• unaligned store halfword (ush) 
These instructions load and store aligned data: 

• load word (lw) 

• load halfword (lh) 

• load halfword unsigned (lhu) 

• load byte (lb) 

• load byte unsigned (lbu) 

• store word (sw) 

• store halfword (sh) 

• store byte (sb) 
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Address Formats 

The assembler accepts these formats for addresses: 
Table 2 J Address Formats 



Format 


Address 


(base register) 


base address (zero 
offset assumed) 


expression 


absolute address 


expression (base register) 


based address 


relocatable-symbol 


relocatable 
address 


relocatable-symbol + expression 


relocatable address 


relocatable-symbol ± expression (index register) 


indexed relocatable 
address 



Address Descriptions 



The assembler accepts any combination of the constants and operations described 
in Chapter 2 for expressions in address descriptions. 





Table 2.2 Assembler Addresses 


Expression 


Address Description 


( base-register ) 


Specifies an indexed address, which assumes a zero off- 
set The base-register's contents specify the address. 


expression 


Specifies m absolute address. The assembler generates 
the most locally efficient code for referencing a value at 
the specified address. 


expression ( base-register ) 


Specifies a based address. To get the address, the machine 
adds the value of the expression to the contents of the 
base-register. 


relocatable-symbol 


Specifies a relocatable address. The assembler generates 
the necessary instruction(s) to address the item and gener- 
ates relocatable information for the link editor. 



c 



c 
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Table 2.2 Assembler Addresses (continued) 



Expression Address Description 



relocatable-symbol expression Specifies a relocatable address. To get the address, the as- 

sembler adds or subtracts the value of the expression, 
which has an absolute value, from the relocatable symbol. 
The assembler generates the necessary instruction(s) to ad- 
dress the item and generates relocatable information for the 
link editor. If the symbol name does not appear as a label 
anywhere in the assembly, the assembler assumes that the 
symbol is external. 

relocatable-symbol ( base-register ) Specifies an indexed relocatable address. To get the 

address, the machine adds the index-register to the 
relocatable symbol's address. The assembler generates 
the necessary instruction(s) to address the item and gen- 
erates relocatable information for the link editor. If the 
symbol name does not appear as a label anywhere in the 
assembly, the assembler assumes that the symbol is ex- 
ternal. 

( base-register ) Specifies an indexed relocatable address. To get the address, the 

assembler adds or subtracts the relocatable symbol, the expres- 
sion, and the contents of the index-register. The assembler gener- 
ates the necessary instruction(s) to address the item and generates 
relocation information for the link editor. If the symbol does not 
appear as a label anywhere in the assembly, the assembler as- 
sumes that the symbol is external. 
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This chapter describes the exceptions that you can encounter while running as- 
sembly programs. The machine detects some exceptions directly, and the assem- 
bler inserts specific tests that signal other exceptions. This chapter lists only 
those exceptions that occur most frequently. 

Main Processor Exceptions 

For the assembly language programmer, these are the most common main proc- 
essor exceptions: 

• Address error exceptions, which occur when the machine refer- 
ences a data item that is not on its proper memory alignment or 
when an address is invalid for the executing process 

• Overflow exceptions, which occur when arithmetic operations 
compute signed values and the destination lacks the precision to 
store the result 

• Bus exceptions, which occur when an address is invalid for the 
executing process 

• Divide-by-zero exceptions, which occur when a divisor is zero 

Floating Point Exceptions 

These are the floating point exceptions: 

• Invalid operation exceptions 

o magnitude subtraction of infinities, for example: 
+ — 1 

o multiplication of by 1 with any signs 

o division of 0/0 or 1/1 with any signs 

o conversion of a binary floating-point number to 
an integer format when an overflow or the oper- 
and value for the infinity or NaN precludes a 
faithful representation in the format (see Chap- 
ter 4) 

o comparison of predicates that have unordered 
operands, and that involve Greater Than or Less 
Than without Unordered. 
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o any operation on a signaling NaN ^ 

Divide-by-zero exceptions ^ 

Overflow exceptions— these occur when a rounded floating 
point result exceeds the destination format's largest finite num- 
ber 

Underflow exceptions — these occur when a result has lost accu- 
racy and also when a nonzero result is between 2 Emin (plus or mi- 
nus 2 to the minimum expressible exponent). 

Inexact exceptions 
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Tokens 



Comments 



This chapter discusses lexical conventions for these topics: 

• tokens 

• comments 

• identifiers 

• constants 

• multiple lines per physical line 

• sections and location counters 

• statements 

• expressions 

This chapter uses the following notation to describe syntax: 

• I (vertical bar) means "or" 

• [ ] (square brackets) enclose options 

• ± indicates both addition and subtraction operations 

The assembler has these tokens: 

• identifiers 

• constants 

• operators 

The assembler lets you put blank characters and tab characters anywhere between 
tokens; however, it does not allow these characters within tokens (except for 
character constants). A blank or tab must separate adjacent identifiers or con- 
stants that are not otherwise separated. 



The pound sign character (#) introduces a comment. Comments that start with a 
# extend through the end of the line on which they appear. You can also use O 
language notation /*...*/ to delimit comments. 

The assembler uses cpp (the C language preprocessor) to preprocess assembler 
code. Because cpp interprets #s in the first column as pragmas (compiler direc- 
tives), do not start a # comment in the first column. 
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Identifiers 



Constants 



Scalar Constants 



An identifier consists of a case-sensitive sequence of alphanumeric characters, 
including these: 

• . (period) 

• _ (underscore) 

• $ (dollar sign) 

Identifiers can be up to 31 characters long, and the first character cannot be nu- 
meric. 

If an identifier is not defined to the assembler (only referenced), the assembler 
assumes that the identifier is an external symbol The assembler treats the identi- 
fier like a *globI pseudo-operation (see Chapter 8). If the identifier is defined to 
the assembler and the identifier has not been specified as global, the assembler 
assumes that the identifier is a local symbol. 

The assembler has these constants: 

• scalar constants 

• floating point constants 

• string constants 



The assembler interprets all scalar constants as twos complement numbers. Sca- 
lar constants can be any of the digits 0123456789abcdefABCDEF. 

Scalar constants can be one of these constants: 

• decimal constants, which consist of a sequence of decimal digits 
without a leading zero 

• hexadecimal constants, which consist of the characters Ox (or OX 
) followed by a sequence of digits 

• octal constants, which consist of a leading zero followed by a 
sequence of digits in the range 0..7 



Floating Point Constants 



Floating point constants can appear only in float and .double pseudo-operations 
(directives)— see Chapter 8 — and in the floating point Load Immediate instruc- 
tions—see Chapter 6. Floating point constants have this format: 



±dl[.d2][elE±d3] 



Where: 



dl is written as a decimal integer and denotes the integral part of 
the floating point value 



c 



c 



c 
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String Constants 



• 



• 



d2 is written as a decimal integer and denotes the fractional part 
of the floating point value 



<B is written as a decimal integer and denotes a power of 10 

• the "+" symbol is optional 

For example: 
21/73E-3 

represents the number .02173. 

float and .double directives may optionally use hexadecimal floating point con- 
stants instead of decimal ones. A hexadecimal floating point constant consists 
of: 

<+ or -> Ox <1 or or nothing> . <hex digits> H Ox <hex digits> 

The assembler places the first set of hex digits (excluding the or 1 preceding 
the decimal point) in the mantissa field of the floating point format without at- 
tempting to normalize it. It stores the second set of hex digits into the exponent 
field without biasing them. It checks that the exponent is appropriate if the man- 
tissa appears to be denormalized. Hexadecimal floating point constants are use- 
ful for generating DEEE special symbols, and for writing hardware diagnostics. 

For example, either of the following generates a single-precision "1.0": 

.float 1.0e+0 
.float 0xl.0h0x7f 



String constants begin and end with double quotation marks ("). 

The assembler observes C language backslash conventions. For octal notation, 
the backslash conventions require three characters when the next character could 
be confused with the octal number. For hexadecimal notation, the backslash con- 
ventions require two characters when the next character could be confused with 
the hexadecimal number (i.e., use a for the first character of a single character 
hex number). 

The assembler follows the backslash conventions shown in Table 4.1 : 
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Table 4. J Backslash Conventions 



Convention 


Meaning 


\a 


alert (0x07) 


\b 


backspace (0x08) 


\f 


form feed (0x0c) 


An 


newline (0x0a) 


\r 
\t 


carriage return (OxOd) 


horizontal tab (0x09) 


\y 


vertical feed (0x0b) 


\\ 


backslash (0x5c) 


\" 


quotation mark (0x22) 


V 


single quote (0x27) 


\000 


character whose octal value is 000 


\Xnn 


character whose hexadecimal value is nn 



c 



Multiple Lines Per Physical Line 

You can include multiple statements on the same line by separating the state- 
ments with semicolons. The assembler does not recognize semicolons as separa- 
tors when they follow comment symbols (# or '/*). 

Sections and Location Counters 

Assembled code and data fall in one of six sections: 



( 



c 



*T"" I, " , *T 
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liisil 



Jf data 



O.U.'A'. 1 . 1 . 1 .'. 1 . 1 .'. 1 .". ' . 



liilii 



' i; *'r?*iri*^ i r^ 



llliiii 



♦sbss 



11111 



text section 



read-only data section 



data section 



small data section, addressed 
through register $gp 



small bss section, addressed 
through register $gp 



bss (block started by storage) 
section, which holds zero- 
initialized data 



Figure 4.1 Section and location counters 

(For more information on section data, see Chapter 9 of this manual.) 

The assembler always generates the text section before other sections. Additions 
to the text section happen in four-byte units. Each section has an implicit loca- 
tion counter, which begins at zero and increments by one for each byte assembled 
in the section. 

The bss section holds zero-initialized data. If a Xcomm pseudo-op defines a 
variable (see Chapter 8), the assembler assigns that variable to the bss (block 
started by storage) section or to the sbss (short block started by storage) section 
depending on the variable's size. The default variable size for sbss is 8 or fewer 
bytes. 

The command line option -G for each compiler (C, Pascal, Fortran 77, or the 
assembler), can increase the size of sbss to cover all but extremely large data 
items. The link editor issues an error message when the -G value gets too large. 
If a -G value is not specified to the compiler, 8 is the default. Items smaller than, 
or equal to, the specified size go in sbss. Items greater than the specified size go 
in bss. 

Because you can address items much more quickly through $gp than through a 
more general method, put as many items as possible in sdata or sbss. The size of 
sdata and sbss combined must not exceed 64K bytes. 
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Statements 



Label Definitions 



Null Statements 



Each statement consists of an optional label, an operation code, and the oper- 
and^). The machine allows these statements: 

• null statements 

• keyword statements 



A label definition consists of an identifier followed by a colon. Label definitions 
assign the current value and type of the location counter to the name. An error 
results when the name is already defined, the assigned value changes the label 
definition, or both conditions exists. 

Label definitions always end with a colon. You can put a label definition on a 
line by itself, 

A generated label is a single numeric value (1... 255). To reference a generated 
label, put an f (forward) or a b (backward) immediately after the digit. The refer- 
ence tells the assembler to look for the nearest generated label that corresponds to 
the number in the lexically forward or backward direction. 



A null statement is an empty statement that the assembler ignores. Null state- 
ments can have label definitions. For example, this line has three null statements 
in it: 



label : 



Keyword Statements 



Expressions 



A keyword statement begins with a predefined keyword. The syntax for the rest 
of the statement depends on the keyword. All instruction opcodes are keywords. 
All other key words are assembler pseudo-operations (directives), 



An expression is a sequence of symbols that represent a value. Each expression 
and its result have data types. The assembler does arithmetic in twos comple- 
ment integers with 32 bits of precision. Expressions follow precedence rules and 
consist of: 

• operators 

• identifiers 

• constants 

Also, you may use a single character string in place of an integer within an ex- 
pression. Thus: 



.byte "a" / ,word "a"+0xl< 



c 



c 



is equivalent to: 



c 
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Precedence 



.byte 0x61 ; .word 0x7a 



Unless parentheses enforce precedence, the assembler evaluates all operators of 
the same precedence strictly from left to right. Because parentheses also desig- 
nate index-registers, ambiguity can arise from parentheses in expressions. To 
resolve this ambiguity, put a unary + in front of parentheses in expressions. 

The assembler has three precedence levels, which are listed here from lowest to 
highest precedence: 



least binding, 
lowest precedence: 

most binding 
highest precedence: 



binary +, - 

binary *, /, %, «, », A , &, I 
unary -, +, ~ 



NOTE: The assembler's precedence scheme differs from that of the C language. 



Expression Operators 

For expressions, you can rely on the precedence rules, or you can group expres- 
sions with parentheses. The assembler has these operators: 

Table 4.2 Expression Operators 



Data Types 



Operator 


Meaning 


+ 


addition 


- 


subtraction 


* 


multiplication 


/ 


division 


% 


remainder 


« 


shift left 


» 


shift right (sign NOT extended) 


A 


bitwise EXCLUSIVE OR 


& 


bitwise AND 


1 


bitwise OR 


- 


minus (unary) 


+ 


identity (unary) 


~ 


complement 



The assembler manipulates several types of expressions. Each symbol you refer- 
ence or define belongs to one of these categories: 
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Table 43 Data Types 



Type 



Description 



undefined 



sundefined 



absolute 
text 

data 



sdata 



rdata 



bss and sbss 



Any symbol that is referenced but not defined becomes global undefined, and this 
module will attempt to import it. The assembler uses 32-bit addressing to access 
these symbols. (Declaring such a symbol in a .globl pseudo-op merely makes its 
status clearer). 

A symbol defined by a .extern pseudo-op becomes global small undefined if its 
size is greater than zero but less than the number of bytes specified by the -G op- 
tion on the command line (which defaults to 8). The linker places these symbols 
within a 64k byte region pointed to by the $gp register, so that the assembler can 
use economical 16-bit addressing to access them. 

A constant defined in an "=" expression. 

The text section contains the program's instructions, which are not modifiable 
during execution. Any symbol defined while the .text pseudo-op is in effect be- 
longs to the text section. 

The data section contains memory which the linker can initialize to nonzero values 
before your program begins to execute. Any symbol defined while the .data 
pseudo-op is in effect belongs to the data section. The assembler uses 32-bit ad- 
dressing to access these symbols. 

This category is similar to data, except that defining a symbol while the .sdata 
("small data") pseudo-op is in effect causes the linker to place it within a 64k byte 
region pointed to by the $gp register, so that the assembler can use economical 
16-bit addressing to access it. 

Any symbol defined while the .rdata pseudo-op is in effect belongs to this cate- 
gory, which is similar to data, but may not be modified during execution. 

The bss and sbss sections consist of memory which the kernel loader initializes 
to zero before your program begins to execute. Any symbol defined in a xomm 
or Jcomm pseudo-op belongs to these sections (except that a .data, .sdata, or 
.rdata pseudo-op can override a xomm directive). If its size is less than the 
number of bytes specified by the -G option on the command line (which defaults 
to 8), it belongs to sbss ("small bss"), and the linker places it within a 64k byte 
region pointed to by the $gp register so that the assembler can use economical 
16-bit addressing to access it. Otherwise, it belongs to bss and the assembler 
uses 32-bit addressing. 

Local symbols in bss or sbss defined by Jcomm are allocated memory by 
the assembler; global symbols are allocated memory by the link editor; and 
symbols defined by xomm are overlaid upon like-named symbols (in the 
fashion of Fortran "COMMON" blocks) by the link editor. 



c 
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Symbols in the undefined and small undefined categories are always global (that 
is, they are visible to the link editor and can be shared with other modules of your 
program). Symbols in the absolute, text, data, sdata, rdata, bss, and sbss catego- 
ries are local unless declared in a .globl pseudo-op. 



Type Propagation in Expressions 



When expression operators combine expression operands, the result's type de- 
pends on the types of the operands and on the operator. Expressions follow these 
type propagation rules: 

• If an operand is undefined, the result is undefined. 

• If both operands are absolute, the result is absolute. 

• If the operator is + and the first operand refers to a relocatable 
text-section, data-section, bss-section, or an undefined external, 
the result has the postulated type and the other operand must be 
absolute. 

• If the operator is - and the first operand refers to a relocatable 
text-section, data-section, or bss-section symbol, the second 
operand can be absolute (if it previously defined) and the result 
has the first operand's type; or the second operand can have the 
same type as the first operand and the result is absolute. If the 
first operand is external undefined, the second operand must be 
absolute. 

• The operators *,/,%,«,», % A , & , and | apply only to 
absolute symbols. 
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Instruction Set 

This chapter describes instruction notation and discusses assembler instructions 
for the main processor. Chapter 6 describes coprocessor notation and instruc- 
tions. 

Instruction Classes 

The assembler has these classes of instructions for the main processor: 

Load and Store Instructions These instructions load immedi- 
ate values and move data between memory and general registers. 

Computational Instructions These instructions do arithmetic 
and logical operations for values in registers. 

Jump and Branch Instructions These instructions change pro- 
gram control flow. 

Coprocessor Interface These instructions provide standard 
interfaces to the coprocessors. 

Special Instructions These instructions do miscellaneous tasks. 

Reorganization Constraints and Rules 

To maximize, performance the goal of RISC designs is to achieve an execution 
rate of one machine cycle per instruction. In writing assembly language instruc- 
tions, you must be aware of the rules and contrains to achieve this goal. This in- 
formation is given in the MIPS RISC Architecture book (order number 
311 1DOC— one is shipped with each RISC/os system). You should refer to the 
following sections in this book for more information: 



ipter 


Section Title 


1 
1 
1 
1 


Cycles/Instruction 
Instruction Pipelines 
Instruction Operation Time 
Instruction Access Time 


3 
3 
3 
C 


The Delayed Instruction Slot 
Delayed Loads 
Delayed Jumps and Branches 
Filling the Branch Delay Slot 



Refer also to Figlure 7.4 FPA Instruction Execution Times in Chapter 7 of the 
same book. 
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( 

The tables in this chapter list the assembler format for each load, store, computa- 
tional, jump, branch, coprocessor, and special instruction. The format consists of 
an op-code and a list of operand formats. The tables list groups of closely re- 
lated instructions; for those instructions, you can use any op-code with any 
specified operand. Operands can take any of these formats; 



• 



memory references— for example a relocatable symbol +/- an 
expression(register) 

• expressions (for immediate values) 

• two or three operands— for example, add $3,$4 is the same as 
add $3,$3,$4 

Load and Store Instructions 

The machine has general-purpose load and store instructions. 



c 



c 
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Load and Store Formats 

The operands in Table 5.1 have the following meanings: 



Operand 


Description 


destination 
address 
source 
expression 


the destination register 

a symbolic expression (see Chapter 2) 

the source register 

an absolute value 



Table 5 J Load and Store Formats 



Description 


Op-code 


Operands 


Load Address 


la 


destination, address 


Load Byte 


lb 




Load Byte Unsigned 


Ibu 




Load Halfword 


lh 




Load Halfword Unsigned 


lhu 




Load Word 


lw 




Load Word Left 


lwl 




Load Word Right 


lwr 




Load Double 


Id 




Unaligned Load Halfword 


ulh 




Unaligned Load Halfword Unsigned 


ulhu 




Unaligned Load Word 


ulw 




Load Immediate 


li 


destination, expression 


Load Upper Immediate 


lui 




Store Byte 


sb 


source, address 


Store Double 


sd 




Store Halfword 


sh 




Store Word Left 


swl 




Store Word Right 


swr 




Store Word 


sw 




Unaligned Store Halfword 


ush 




Unaligned Store Word 


usw 





Load Instruction Descriptions 



For all machine load instructions, the effective address is the 32-bit twos-com- 
plement sum of the contents of the index-register and the (sign-extended) 16-bit 
offset. Instructions that have symbolic labels imply an index-register, which the 
assembler determines. The assembler supports additional load instructions, 
which can produce multiple machine instructions. 

NOTE: Load instructions can generate many code sequences for which the link 
editor must fix the address by resolving external data items. 
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Table 52 Load Instruction Descriptions 



Instruction Name 


Description 


Load Address (la) 


Loads the destination register with the effective address of the 
specified data item. 


Load Byte (lb) 


Loads the least significant byte of the destination register with the 
contents of the byte that is at the memory location specified by the 
effective address. The machine treats the loaded byte as a signed 
value: bit seven is extended to fill the three most significant 
bytes. 


Load Byte Unsigned (Ibu) 


Loads the least significant byte of the destination register with the 
contents of the byte that is at the memory location specified by the 
effective address. Because the machine treats the loaded byte as 
an unsigned value, it fills the three most significant bytes of the 
destination register with zeros. 


Load Double (Id) 


Loads the register pair (destination and destination + 1) with the 
two successive words specified by the address. The destination 
register must be the even register of the pair. When the address is 
not on a word boundary, the machine signals an address error ex- 
ception. NOTE: For compatibility with future machines, we rec- 
ommend the use of double word alignment for all double word 
operands. 


Load Halfword (lh) 


Loads the two least significant bytes of the destination register 
with the contents of the halfword that is at the memory location 
specified by the effective address. The machine treats the loaded 
halfword as a signed value. If the effective address is not even, 
the machine signals an address error exception. 


Load Halfword Unsigned (ihu) 


Loads the least significant bits of the destination register with the 
contents of the halfword that is at the memory location specified 
by the effective address. Because the machine treats the loaded 
halfword as an unsigned value, it fills the two most significant 
bytes of the destination register with zeros. If the effective ad- 
dress is not even, the machine signals an address error exception. 


Load Immediate (li) 


Loads the destination register with the value of an expression that 
can be computed at assembly time. 




NOTE: Load Immediate can generate any efficient code se- 
quence to put a desired value in the register. 


Load Upper Immediate (lui) 


Loads the most significant half of a register with the expression's 
value, The machine fills the least significant half of the register 
with zeros. The expression's value must be in the range 
-32768,.,65535. 
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Table 5.2 Load Instruction Descriptions (continued) 



Instruction Name 



Description 



Load Word (lw) 



Load Word Left Owl) 



Load Word Right (lwr) 



Unaligned Load Halfword (ulh) 



Unaligned Load Halfword 
Unsigned (ulhu) 



Unaligned Load Word (ulw) 



Loads the destination register with the contents of the word that is 
at the memory location. The machine replaces all bytes of the reg- 
ister with the contents of the loaded word. 
The machine signals an address error exception when the effec- 
tive address is not divisible by four. 

Loads the sign— that is, Load Word Left loads the destination 
register with the most significant bytes of the word specified by 
the effective address. The effective address must specify the 
byte containing the sign. In a big-endian machine, the effective 
address specifies the lowest numbered byte, and in a little-en- 
dian machine the effective address specifies the highest num- 
bered byte. 

Only the bytes which share the same aligned word in memory are 
merged into the destination register. 

Loads the lowest precision bytes — that is, Load Word Right 
loads the destination register with the least significant bytes of the 
word specified by the effective address. The effective address 
must specify the byte containing the least significant bits. In a 
big-endian machine, the effective address specifies the highest 
numbered byte, and in a little-endian machine the effective ad- 
dress specifies the lowest numbered byte. 

Only the bytes which share the same aligned word in memory are 
merged into the destination register. 

Loads a halfword into the destination register from the specified 
address and extends the sign of the halfword. Unaligned Load 
Halfword loads a halfword regardless of the halfword' s align- 
ment in memory. 

Loads a halfword into the destination register from the specified 
address and zero extends the halfword. Unaligned Load 
Halfword Unsigned loads a halfword regardless of the 
halfword' s alignment in memory. 

Loads a word into the destination register from the specified ad- 
dress. Unaligned Load Word loads a word regardless of the 
word's alignment in memory. 
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Store Instruction Descriptions 



For all machine store instructions, the effective address is the 32-bit twos-com- 
plement sum of the contents of the index-register and the (sign-extended) 16-bit 
offset The assembler supports additional store instructions, which can produce 
multiple machine instructions. Instructions that have symbolic labels imply an 
index-register, which the assembler determines. 

Table 5.3 Store Instruction Descriptions 



c 



Instruction Name 


Description 


Store Byte (sb) 


Stores the contents of the source register's least significant 
byte in the byte specified by the effective address. 


Store Halfword (sh) 


Stores the two least significant bytes of the source register in 
the halfword that is at the memory location specified by the 
effective address. The effective address must be divisible by 
two, otherwise the machine signals an address error exception. 


Store Word (sw) 


Stores the contents of a word from the source register in the 
memory location specified by the effective address. The effec- 
tive address must be divisible by four, otherwise the machine 
signals an address error exception. 


Store Double (sd) 


Stores the contents of the register pair in successive words, 
which the address specifies. The source register must be the 
even register of the pair, and the storage address must be word 
aligned. 




NOTE: For compatibility with future machines, we recom- 
mend that you use double word alignment. 


Store Word Left (swl) 


Stores the most significant bytes of a word in the memory lo- 
cation specified by the effective address. The contents of the 
word at the memory location, specified by the effective ad- 
dress, are shifted right so that the leftmost byte of the un- 
aligned word is in the addressed byte position. The stored 
bytes replace the corresponding bytes of the effective address. 
The effective address's last two bits determine how many 
bytes are involved. 



c 



c 
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Table 5.3 Store Instruction Description (continued) 



Instruction Name 


Description 


Store Word Right (swr) 

Unaligned Store Halfword (ush) 
Unaligned Store Word (usw) 


Stores the least significant bytes of a word in the memory 
location specified by the effective address. The contents of 
the word at the memory location, specified by the effective 
address, are shifted left so that the right byte of the unaligned 
word is in the addressed byte position. The stored bytes re- 
place the corresponding bytes of the effective address. The 
effective address's last two bits determine how many bytes 
are involved. 

Stores the contents of the two least significant bytes of the 
source register in a halfword that the address specifies. The 
machine does not require alignment for the storage address. 

Stores the contents of the source register in a word specified 
by the address. The machine does not require alignment for 
the storage address. 



Computational Instructions 

The machine has general-purpose and coprocessor-specific computational in- 
structions (for example, the floating point coprocessor). This part of the book 
describes general-purpose computational instructions. 

Computational Formats 

In the Table 5.4 , operands have the following meanings: 



Operand 


Description 


destination/srcl 


the destination register is 
also source register 1 


destination 


the destination register 


immediate 


the immediate value 


srcl,src2 


the source registers 
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Table 5.4 Computational Instruction Formats 



Description 


Op-code 


Operand 


Add (with overflow) 

Add (without overflow) 

AND 

Divide (signed) 

Divide (unsigned) 

EXCLUSIVE OR 

Multiply 

Multiply with Overflow 

Multiply with Overflow Unsigned 

NOT OR 

OR 

Set Equal 

Set Greater 

Set Greater/Equal 

Set Greater/Equal Unsigned 

Set Greater Unsigned 

Set Less 

Set Less/Equal 

Set Less/Equal Unsigned 

Set Less Unsigned 

Set Not Equal 

Subtract (with overflow) 

Subtract (without overflow) 


add 

addu 

and 

div 

divu 

xor 

mul 

mulo 

mulou 

nor 

or 

seq 

sgt 

sge 

sgeu 

sgtu 

sit 

sle 

sleu 

situ 

sne 

sub 

subu 


destination,srcl,src2 
destination/srcl ,src2 

destinations 1, 

immediate 
destination/srcl, 

immediate 


Remainder (signed) 
Remainder (unsigned) 
Rotate Left 
Rotate Right 
Shift Right Arithmetic 
Shift Left Logical 
Shift Right Logical 


rem 

remu 

rol 

ror 

sra 

sli 

srl 


destination,srcl,src2 
destination/srcl, src2 
destinations 1, 

immediate 
destination/srcl, 

immediate 


Absolute Value 
Negate (with overflow) 
Negate (without overflow) 
NOT 

Move 


abs 
neg 
negu 
not 

move 


destinations 1 
destination/srcl 

destinations! 


Multiply 
Multiply (unsigned) 


mult 
multu 


srcl,src2 



c 
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Computational Instruction Descriptions 

Table 5.5 Computational Instruction Descriptions 



Instruction Name 



Description 



Absolute Value (abs) 



Add (with overflow) (add) 



Add (without overflow) (addu) 



Divide (signed) (div) 



Computes the absolute value of the contents of srcl and puts the 
result in the destination register. If the value in srcl is 
-2147483648, the machine signals an overflow exception. 

Computes the twos complement sum of two signed values. This 
instruction adds the contents of srcl to the contents of src2, or it 
can add the contents of srcl to the immediate value. Add (with 
overflow) puts the result in the destination register. When the 
result cannot be extended as a 32-bit number, the machine sig- 
nals an overflow exception. 

Computes the twos complement sum of two 32-bit values. This 
instruction adds the contents of srcl to the contents of src2, or it 
can add the contents of srcl to the immediate value. Add (with- 
out overflow) puts the result in the destination register. Over- 
flow exceptions never occur. 

AND (and) Computes the Logical AND of two values. This 
instruction ANDs (bit-wise) the contents of srcl with the con- 
tents of src2, or it can AND the contents of srcl with the imme- 
diate value. The immediate value is not sign extended. AND 
puts the result in the destination register. 

Computes the quotient of two values. Divide (with overflow) 
treats srcl as the dividend. The divisor can be src2 or the imme- 
diate value. The instruction divides the contents of srcl by the 
contents of src2, or it can divide srcl by the immediate value. It 
puts the quotient in the destination register. If the divisor is zero, 
the machine signals an error and may issue a break instruction. 
The div instruction rounds toward zero. Overflow is signaled 
when dividing -2147483648 by -1. The machine may issue a 
break instruction for divide-by-zero or for overflow. 

NOTE: The special case 
div $0,srcl,src2 

generates the real machine divide instruction and leaves the 
result in the hi/lo register. The hi register contains the remain- 
der and the lo register contains the quotient. No checking for 
divide by zero is performed. 
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Table 5.5 Computational Instruction Descriptions (continued) 



Instruction Name 



Divide (unsigned) (divu) 



EXCLUSIVE OR (xor) 



Move (move) 
Multiply (mul) 



Multiply (mult) 



Multiply Unsigned (multu) 



Description 



Computes the quotient of two unsigned 32-bit values. Divide 
(without overflow) treats srcl as the dividend. The divisor can 
be src2 or the immediate value. This instruction divides the 
contents of srcl by the contents of src2, or it can divide the con- 
tents of srcl by the immediate value. Divide (without over- 
flow) puts the quotient in the destination register. If the divisor 
is zero, the machine signals an exception and may issue a break 
instruction. 

See the note for div concerning $0 as a destination. 
Overflow exceptions never occur. 

Computes the XOR of two values. This instruction XORs (bit- 
wise) the contents of srcl with the contents of src2, or it can XOR 
the contents of srcl with the immediate value. The immediate 
value is not sign extended. EXCLUSIVE OR puts the result in 
the destination register. 

Moves the contents of srcl to the destination register. 

Computes the product of two values. This instruction puts the 
32-bit product of srcl and src2, or the 32-bit product of srcl and 
the immediate value, in the destination register. The machine 
does not report overflow. 

NOTE: Use mul when you do not need overflow protection: 
it's often faster than mulo and mulou. For multiplication by a 
constant, the mul instruction produces faster machine instruction 
sequences than mult or multu instructions can produce. 

Computes the 64-bit product of two 32-bit signed values. This 
instruction multiplies the contents of srcl by the contents of src2 
and puts the result in the hi and lo registers (see Chapter 1). No 
overflow is possible. 

NOTE: The mult instruction is a real machine-language in- 
struction 

Computes the product of two unsigned 32-bit values. It mul- 
tiplies the contents of srcl and the contents of src2 and puts 
the result in the hi and lo registers (see Chapter 1). No over- 
flow is possible. 

NOTE: The multu instruction is a real machine language 
instruction. 



c 
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Table 55 Computational Instruction Descriptions (continued) 



Instruction Name 



Description 



Multiply with Overflow (mulo) 



Multiply with Overflow Unsigned 

(mulou) 



Negate (with overflow) (neg) 



Negate (without overflow) 

(negu) 

NOT (not) 



NOT OR (nor) 



OR (or) 



Computes the product of two 32-bit signed values. Multiply with 
Overflow puts the 32-bit product of srcl and src2, or the 32-bit 
product of srcl and the immediate value, in the destination regis- 
ter. When a overflow occurs, the machine signals an overflow 
exception and may execute a break instruction. 

NOTE: For multiplication by a constant, mulo produces faster 
machine instruction sequences than mult or multu can produce; 
however, if you do not need overflow detection, use the mul in- 
struction. It's often faster than mulo. 

Computes the product of two 32-bit unsigned values. Multiply 
with Overflow Unsigned puts the 32-bit product of srcl and 
src2, or the product of srcl and the immediate value, in the des- 
tination register. This instruction treats the multiplier and mul- 
tiplicand as 32-bit unsigned values. When an overflow occurs, 
the machine signals an overflow exception and may issue an 
break instruction. 

NOTE: For multiplication by a constant, mulou produces 
faster machine instruction sequences than mult or multu can 
produce; however, if you do not need overflow detection, use 
the mul instruction. It's often faster than mulou. 

Computes the negative of a value. This instruction negates the 
contents of srcl and puts the result in the destination register. If 
the value in srcl is -2147483648, the machine signals an over- 
flow exception. 

Negates the integer contents of srcl and puts the result in the des- 
tination register. The machine does not report overflows. 

Computes the Logical NOT of a value. This instruction comple- 
ments (bit-wise) the contents of srcl and puts the result in the 
destination register. 

Computes the NOT OR of two values. This instruction combines 
the the contents of srcl with the contents of src2 (or the immedi- 
ate value). NOT OR complements the result and puts it in the 
destination register. 

Computes the Logical OR of two values. This instruction ORs 
(bit-wise) the contents of srcl with the contents of src2, or it can 
OR the contents of srcl with the immediate value. The immediate 
value is not sign extended. Or puts the result in the destination 
register. 
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Table 5,5 Computational Instruction Descriptions (continued) 



Instruction Name 



Description 



Remainder (signed) (rem) 



Remainder (unsigned) (remu) 



Rotate Left (rol) 



Rotate Right (ror) 



Set Equal (seq) 



Computes the remainder of the division of two unsigned 32-bit 
values. The machine defines the remainder rem(ij) as 
Mj*div(ij)) where j • 0. Remainder (with overflow) treats srcl 
as the dividend. The divisor can be src2 or the immediate value. 
This instruction divides the contents of srcl by the contents of 
src2, or it can divide the contents of srcl by the immediate value. 
It puts the remainder in the destination register. The rem instruc- 
tion rounds toward zero, rather than toward negative infinity. For 
example, div(5,-3)=-l, and rem(5,-3)=2. For divide-by-zero, 
the machine signals an error and may issue a break instruction. 

Computes the remainder of the division of two unsigned 32-bit 
values. The machine defines the remainder rem(ij) as 
i-(pdiv(i j)) where j • 0. Remainder Unsigned treats srcl as 
the dividend. The divisor can be src2 or the immediate value. 
This instruction divides the contents of srcl by the contents of 
src2, or it can divide the contents of srcl by the immediate 
value. Remainder Unsigned puts the remainder in the destina- 
tion register. For divide by zero, the machine signals an error 
and may issue a break instruction. 

Rotates the contents of a register left (toward the sign bit). This 
instruction inserts in the least significant bit any bits that were 
shifted out of the sign bit. The contents of srcl specify the 
value to shift, and the contents of src2 (or the immediate value) 
specify the amount to shift. Rotate Left puts the result in the 
destination register. If src2 (or the immediate value) is greater 
than 31, srcl shifts by (src2 MOD 32). 

Rotates the contents of a register right (toward the least signifi- 
cant bit). This instruction inserts in the sign bit any bits that 
were shifted out of the least significant bit. The contents of 
srcl specify the value to shift, and the the contents of src2 (or 
the immediate value) specify the amount to shift. Rotate 
Right puts the result in the destination register. If src2 (or the 
immediate value) is greater than 32, srcl shifts by src2 MOD 
32. 

Compares two 32-bit values. If the contents of srcl equal the 
contents of src2 (or srcl equals the immediate value) this in- 
struction sets the destination register to one; otherwise, it sets 
the destination register to zero. 



c 



5-12 



Assembly Language Programmer's Guide 



Instruction Set 



Table 5.5 Computational Instruction Descriptions (continued) 



Instruction Name 



Description 



Set Greater (sgt) 



Set Greater/Equal (sge) 



Set Greater/Equal Unsigned 

(sgeu) 



Set Greater Unsigned (sgtu) 



Set Less (sit) 



Set Less/Equal (sle) 



Set Less/Equal Unsigned (sleu) 



Set Less Unsigned (situ) 



Compares two signed 32-bit values. If the contents of srcl are 
greater than the contents of src2 (or srcl is greater than the im- 
mediate value), this instruction sets the destination register to 
one; otherwise, it sets the destination register to zero. 

Compares two signed 32-bit values. If the contents of srcl are 
greater than or equal to the contents of src2 (or srcl is greater 
than or equal to the immediate value), this instruction sets the 
destination register to one; otherwise, it sets the destination reg- 
ister to zero. 

Compares two unsigned 32-bit values. If the contents of srcl 
are greater than or equal to the contents of src2 (or srcl is greater 
than or equal to the immediate value), this instruction sets the 
destination register to one; otherwise, it sets the destination reg- 
ister to zero. 

Compares two unsigned 32-bit values. If the contents of srcl 
are greater than the contents of src2 (or srcl is greater than the 
immediate value), this instruction sets the destination register to 
one; otherwise, it sets the destination register to zero. 

Compares two signed 32-bit values. If the contents of srcl are 
less than the contents of src2 (or srcl is less than the immediate 
value), this instruction sets the destination register to one; other- 
wise, it sets the destination register to zero. 

Compares two signed 32-bit values. If the contents of srcl are 
less than or equal to the contents of src2 (or srcl is less than or 
equal to the immediate value), this instruction sets the destina- 
tion register to one; otherwise, it sets the destination register to 
zero. 

Compares two unsigned 32-bit values. If the contents of srcl 
are less than or equal to the contents of src2 (or srcl is less than 
or equal to the immediate value) this instruction sets the desti- 
nation register to one; otherwise, it sets the destination register 
to zero. 

Compares two unsigned 32-bit values. If the contents of srcl 
are less than the contents of src2 (or srcl is less than the im- 
mediate value), this instruction sets the destination register to 
one; otherwise, it sets the destination register to zero. 
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Table 5.5 Computational Instruction Descriptions (continued) 



( 



Instruction Name 



Description 



Set Not Equal (sne) 



Shift Left Logical (sll) 



Shift Right Arithmetic (sra) 



Shift Right Logical (srl) 



Subtract (with overflow) (sub) 



Subtract (without overflow) 

(subu ) 



Compares two 32-bit values. If the contents of scrl do not 
equal the contents of src2 (or srcl does not equal the immedi- 
ate value), this instruction sets the destination register to one; 
otherwise, it sets the destination register to zero. 

Shifts the contents of a register left (toward the sign bit) and in- 
serts zeros at the least significant bit. The contents of srcl specify 
the value to shift, and the contents of src2 or the immediate value 
specify the amount to shift. If src2 (or the immediate value) is 
greater than 31 or less than 0, srcl shifts by src2 MOD 32. 

Shifts the contents of a register right (toward the least signifi- 
cant bit) and inserts the sign bit at the most significant bit. The 
contents of srcl specify the value to shift, and the contents of 
src2 (or the immediate value) specify the amount to shift. If 
src2 (or the immediate value) is greater than 31 or less than 0, 
srcl shifts by the result of src2 MOD 32. 

Shifts the contents of a register right (toward the least signifi- 
cant bit) and inserts zeros at the most significant bit. The con- 
tents of srcl specify the value to shift, and the contents of src2 
(or the immediate value) specify the amount to shift. If src2 (or 
the immediate value) is greater than 31 or less than 0, srcl shifts 
by the result of src2 MOD 32. 

Computes the twos complement difference for two signed val- 
ues. This instruction subtracts the contents of src2 from the 
contents of srcl , or it can subtract the contents of the immediate 
from the srcl value. Subtract puts the result in the destination 
register. When the true result's sign differs from the destination 
register's sign, the machine signals an overflow exception. 

Computes the twos complement difference for two 32-bit val- 
ues. This instruction subtracts the contents of src2 from the 
contents of srcl, or it can subtract the contents of the immediate 
from the srcl value. Subtract Unsigned puts the result in the 
destination register. Overflow exceptions never happen. 



c 



The jump and branch instructions let you change an assembly program's control 
flow. 
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Jump and Branch Formats 

In Table 5.6 below, the operands have the following meanings: 



Operand 


Description 


address 


an expression 


srcl,src2 


the source registers 


target 


register containing the target 


label 


a symbol label 


return 


register containing the return address 


immediate 


an expression with an absolute value 



Table 5.6 Jump and Branch Instruction Formats 



Description 


Op-code 


Operand 


Jump 


J 


address 


Jump and Link 


jal 


address 

target 

return,target 


Branch on Equal 

Branch on Greater 

Branch on Greater/Equal 

Branch on Greater/Equal Unsigned 

Branch on Greater Unsigned 

Branch on Less 

Branch on Less/Equal 

Branch on Less/Equal Unsigned 

Branch on Less Unsigned 

Branch on Not Equal 


beq 

bgt 

bge 

bgeu 

bgtu 

bit 

ble 

bleu 

bltu 

bne 


srcl,src2,label 
srcl, immediate,label 


Branch on Equal to Zero 
Branch on Greater/Equal Zero 
Branch on Greater Than Zero 
Branch on Greater or Equal to Zero 
and Link 

Branch on Less Than Zero and Link 
Branch on Less/Equal Zero 
Branch on Less Than Zero 
Branch on Not Equal to Zero 


beqz 
bgez 
bgtz 

bgezal 

bltzal 

blez 

bltz 

bnez 


srcl, label 


Branch 
Branch and Link 


b 
bal 


label 
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Jump and Branch Instruction Descriptions 



In the following branch instructions, branch destinations must be defined in the 
source being assembled. 

Table 57 Jump and Branch Instruction Descriptions 



Instruction Name 


Description 


Branch (b) 


Branches unconditionally to the specified label. 


Branch and Link (bal) 


Branches unconditionally to the specified label and puts the re- 
turn address in general register $31. 


Branch on Equal (beq) 


Branches to the specified label when the contents of srcl equal 
the contents of src2, or it can branch when the contents of srcl 
equal the immediate value. 


Branch on Equal to Zero (beqz) 


Branches to the specified label when the contents of srcl equal 
zero. 


Branch on Greater (bgt) 


Branches to the specified label when the contents of srcl are 
greater than the contents of src2, or it can branch when the con- 
tents of srcl are greater than the immediate value. The compari- 
son treats the comparands as signed 32-bit values. 


Branch on Greater 
/Equal Unsigned (bgeu) 


Branches to the specified label when the contents of srcl are 
greater than or equal to the contents of src2, or it can branch 
when the contents of srcl are greater than or equal to the imme- 
diate value. The comparison treats the comparands as unsigned 
32-bit values. 


Branch on Greater 
/Equal Zero (bgez) 


Branches to the specified label when the contents of srcl are 
greater than or equal to zero. 


Branch on Greater 

/Equal Zero and Link (bgezal) 


Branches to the specified label when the contents of srcl are 
greater than or equal to zero and puts the return address in gen- 
eral register $31. When this write is done, it destroys the con- 
tents of the register. See the RISC Architecture book for more 
information. 


Branch on Greater or Equal 

(bge) 


Branches to the specified label when the contents of srcl are 
greater than or equal to the contents of src2, or it can branch 
when the contents of srcl are greater than or equal to the imme- 
diate value. The comparison treats the comparands as signed 
32-bit values. 


Branch on Greater 

Than Unsigned 

(bgtu) 


Branches to the specified label when the contents of srcl are 
greater than the contents of src2, or it can branch when the 
contents of srcl are greater than the immediate value. The 
comparison treats the comparands as unsigned 32-bit values. 
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Table 57 Jump and Branch Instruction Descriptions (continued) 



Instruction Name 



on Greater Than Zero 
Branch on Less (bit) 



Branch 

(bgtz) 



Branch on Less/Equal Unsigned 

(bleu) 



Branch on Less/Equal Zero 

(blez) 

Branch on Less or Equal (ble) 



Branch on Less Than Unsigned 

(bltu) 



Branch on Less Than Zero 

(bltz) 

Branch on Less Than 
Zero and Link (bltzal) 



Branch on Not Equal (bne) 



Branch on Not Equal to Zero 

(bnez) 



Description 



Branches to the specified label when the contents of srcl are 
greater than zero. 

Branches to the specified label when the contents of srcl are less 
than the contents of src2, or it can branch when the contents of 
srcl are less than the immediate value. The comparison treats the 
comparands as signed 32-bit values. 

Branches to the specified label when the contents of srcl are less 
than or equal to the contents of src2, or it can branch when the 
contents of srcl are less than or equal to the immediate value. 
The comparison treats the comparands as unsigned 32-bit values. 

Branches to the specified label when the contents of srcl are 
less than or equal to zero. The program must define the destina- 
tion. 

Branches to the specified label when the contents of srcl are 
less than or equal to the contents of src2, or it can branch when 
the contents of srcl are less than or equal to the immediate 
value. The comparison treats the comparands as signed 32-bit 
values. 

Branches to the specified label when the contents of srcl are 
less than the contents of src2, or it can branch when the con- 
tents of srcl are less than the immediate value. The compari- 
son treats the comparands as unsigned 32-bit values. 

Branches to the specified label when the contents of srcl are less 
than zero. The program must define the destination. 

Branches to the specified label when the contents of srcl are less 
than zero and puts the return address in general register $31. 
Because the value is always stored in register 31, there is a 
chance of a stored value being overwritten before it is used. See 
the RISC Architecture book for more information. 

Branches to the specified label when the contents of srcl do not 
equal the contents of src2, or it can branch when the contents of 
srcl do not equal the immediate value. 

Branches to the specified label when the contents of srcl do not 
equal zero. 
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Table 57 Jump and Branch Instruction Descriptions (continued) 



Instruction Name 


Description 


Jump (j) 

Jump And Link (jal) 


Unconditionally jumps to a specified location. A symbolic ad- 
dress or a general register specifies the destination. The instruc- 
tion j $31 returns from the a jal call instruction. 

Unconditionally jumps to a specified location and puts the return 
address in a general register. A symbolic address or a general 
register specifies the target location. By default, the return ad- 
dress is placed in register $31. If you specify a pair of registers, 
the first receives the return address and the second specifies the 
target. The instruction jal procname transfers to procname and 
saves the return address. 

For the two-register form of the instruction, the target register 
may not be the same as the return-address register. For the one- 
register form, the target may not be $31. 



( 



Special Instructions 

The main processor's special instructions do miscellaneous tasks. 
Special Formats 

In Table 5.8 , operands have the following meanings: 



( 



Operand 


Description 


register 
breakcode 


destination or source register 
value that determines the break type 



Table 5.9 Special Instruction Formats 



Description 


Op-code 


Operand 


Break 


break 


breakcode 


Restore From Exception 
Syscall 


rfe 
syscall 




Move From HI Register 
Move To HI Register 
Move From LO Register 
Move To LO Register 


mfhi 
mthi 
mflo 
mtio 


register 
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Special Instruction Descriptions 

Table 5 JO Special Instruction Descriptions 



Instruction Name 



Break (break) 

Move From HI Register (mfhi) 

Move From LO Register (mflo) 
Move To HI Register (mthi) 
Move To LO Register (mtlo) 
Restore From Exception (rfe) 

Syscall (syscall) 



Description 



Unconditionally transfers control to the exception handler. The 
breakcode operand is interpreted by software conventions. 

Moves the contents of the hi register to a general purpose register. 

Moves the contents of the lo register to a general purpose register. 

Moves the contents of a general purpose register to the hi register. 

Moves the contents of a general purpose register to the lo register. 

Restores the previous interrupt callee and user/kernel state. This 
instruction can execute only in kernel state and is unavailable in 
user mode. 

Causes a system call trap. The operating system interprets the 
information set in registers to determine what system call to do. 



Coprocessor Interface Instructions 

The coprocessor interface instructions provide standard ways to access the ma- 
chine's coprocessors.* 

Coprocessor Interface Formats 

In Table 5. 1 1 , the operands have the following meanings: 



Operand 



destination 

dest-gpr 

address 

source 

src-gpr 

operation 

label 



Description 



a coprocessor number in the range 0...3 

the destination coprocessor register 

the destination general register 

a symbolic expression 

a coprocessor register from which values are assigned 

a general register from which values are assigned 

the coprocessor specific operation 

a symbolic label 
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Table 5 .11 coprocessor Interface Instruction Formats 



Description 


Op-code 


Operand 


Load Word Coprocessor z 


iwcz 


destination,address 


Store Word Coprocessor z 


swcz 


source, address 


Move From Coprocessor z 
Move To Coprocessor z 


mfcz 
mtcz 


dest-gpr, source 
src-gpr, destination 


Branch Coprocessor z False 
Branch Coprocessor z True 


bczf 
bczt 


label 


Coprocessor z Operation 


cz 


expression 


Control From Coprocessor z 
Control To Coprocessor z 


cfcz 
ctcz 


dest-gpr, source 
src-gpr, destination 



( 



* You cannot use coprocessor load and store instructions with the system control coproce- 
ssor (cpO). 

Coprocessor Interface Instruction Descriptions 

Table 5.12 Coprocessor Interface Instruction Descriptions 



c 



Instruction Name 


Description 


Branch Coprocessor z True 

(bczt) 

Branch Coprocessor z False 

(bczf) 

Control From Coprocessor z 

(cfcz) 

Control To Coprocessor z 

(ctcz) 

Coprocessor z Operation 
(cz) 


Branches to the specified label when the specified coprocessor 
asserts a true condition. The z selects one of the coprocessors. A 
previous coprocessor operation sets the condition. 

Branches to the specified label when the specified coprocessor 
asserts a false condition. The z selects one of the coprocessors. A 
previous coprocessor operation sets the condition. 

Stores the contents of the coprocessor control register specified 
by the source in the general register specified by dest-gpr. 

Stores the contents of the general register specified by src-gpr in 
the coprocessor control register specified by the destination. 

Executes a coprocessor-specific operation on the specified cop- 
rocessor. The z selects one of four distinct coprocessors. 
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Table 5.12 Coprocessor Interface Instruction Descriptions (continued) 



Instruction Name 


Description 


Load Word Coprocessor z 

(lwcz) 

Move From Coprocessor z 

(mfcz) 

Move To Coprocessor z 

(mtcz) 

Store Word Coprocessor z 

(swcz) 


Loads the destination with the contents of a word that is at the 
memory location specified by the effective address. The z selects 
one of four distinct coprocessors. Load Word Coprocessor re- 
places all register bytes with the contents of the loaded word. If 
bits and 1 of the effective address are not zero, the machine sig- 
nals an address exception. 

Stores the contents of the coprocessor register specified by the 
source in the general register specified by dest-gpr. 

Stores the contents of the general register specified by src-gpr 
in the coprocessor register specified by the destination. 

Stores the contents of the coprocessor register in the memory lo- 
cation specified by the effective address. The z selects one of four 
distinct coprocessors. If bits and 1 of the effective address are 
not zero, the machine signals an address error exception. 
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6 
Coprocessor Instruction Set 



This chapter describes the coprocessor instructions for these coprocessors: 

• system control coprocessor (cpO) instructions 

• floating point coprocessor instructions 

See Chapter NO TAG for a description of the main processor's instructions and 
the coprocessor interface instructions. 



Instruction Notation 



The tables in this chapter list the assembler format for each coprocessor's load, 
store, computational, jump, branch, and special instructions. The format consists 
of an op-code and a list of operand formats. The tables list groups of closely re- 
lated instructions; for those instructions, you can use any op-code with any 
specified operand. 

NOTE: The system control coprocessor instructions do not have operands. Op- 
erands can have any of these formats: 

• memory references — for example a relocatable symbol +/- an 
expression(register) 

• expressions (for immediate values) 

• two or three operands — for example, add $3,$4 is the same as 
add $3,$3,$4 

The following terms are used to discuss floating point operations: 

• infinite — A value of +1 or -1. 

• infinity — A symbolic entity that represents values with magni- 
tudes greater than the largest value in that format. 

• ordered — The usual result from a comparison, namely: <,=, or > 

• NaN — Symbolic entities that represent values not otherwise 
available in floating point formats. There are two kinds of NaNs. 
Quiet NaNs represent unknown or uninitialized values. Signal- 
ing NaNs represent symbolic values and values that are too big 
or too precise for the format. Signaling NaNs raise an invalid 
operation exception whenever an operation is attempted on them. 
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• unordered— The condition that results from a floating-point 
comparison when one or both operands are NaNs. 

Floating Point Instructions 

The floating point coprocessor has these classes of instructions: 

Load and Store Instructions. Load values and move data be- 
tween memory and coprocessor registers. 

Move Instructions. Move data between registers. 

Computational Instructions. Do arithmetic and logical opera- 
tions on values in coprocessor registers. 

Relational Instructions. Compare two floating point values. 

A particular floating point instruction may be implemented in hardware, soft- 
ware, or a combination of hardware and software. 

Floating Point Formats 

The formats for the single and double precision floating point constants are 
shown below. 



c 



1 89 



1 


8 bits 


23 bits 



31 30 23 22 

Single Precision 



1 



1112 



11 bits 



63 62 52 51 



31 (big-endian) 



(little-endian) 



52 bits 



Double Precision 



(big-endian) 
63 




(little-endian) 



( 



Figure 6.1 Floating Point Formats 
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Floating Point Load and Store Formats 



Floating point load and store instructions must use even registers. The operands 
inTable 6.1 have the following meanings: 



Operand 


Meaning 


destination 

address 

source 


the destination register 

offset (base) 

the source register 



Table 6 J Floating Point Load and Store Formats 



Description 


Op-code 


Operand 


Load Fp 

Double 
Single 


l.d 

1.S 


destination, address 


Load Immediate Fp 

Double 

Single 


li.d 
li.s 


destination, floating point constant 


Store Fp 

Double 
Single 


s.d 

s.s 


source, address 



Floating Point Load and Store Descriptions 



This part of Chapter 6 groups the instructions by function. Please consult 
Table 6.1 for the op-codes. 

Table 62 Floating Point Load and Store Descriptions 



Instruction 


Description 


Load Fp Instructions 
Store Fp Instructions 


Load eight bytes for double precision and four bytes for single 
precision from the specified effective address into the destination 
register, which must be an even register. The bytes must be 
word aligned. NOTE: To ensure compatibility with future ma- 
chines, we recommend that you use double word alignment for 
double precision operands. 

Stores eight bytes for double precision and four bytes for single 
precision from the source floating point register in the destina- 
tion register, which must be an even register. NOTE: To en- 
sure compatibility with future machines, we recommend that 
you use double word alignment for double precision operands. 
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Floating Point Computational Formats 



This part of Chapter 6 describes floating point computational instructions. The 
operands in Table 6.3 below have the following meaning: 



c 



Operand 


Meaning 


destination 

source 

gpr 


the destination register 
the source register 
general purpose register 



Table 63 Floating Point Computational Instruction Formats 



Description 


Op-code 


Operand 


Absolute Value Fp 

Double 

Single 
Negate Fp 
Double 

Single 


abs.d 
abs.s 

neg.d 
neg.s 


destination, srcl 


AddFp 

Double 

Single 
Divide Fp 

Double 

Single 
Multiply Fp 

Double 

Single 
Subtract Fp 

Double 

Single 


aekld 
add.s 

div.d 
div.s 

mul.d 
muLs 

sub.d 
sub.s 


destination, srcl, src2 


Conditional Trap 

Trap if Equal 
Trap if not Equal 

Trap if Less Than 

Trap if Less than, Unsigned 

Trap if Greater Than or Equal 

Trap if Greater than or Equal, Unsigned 


teq 
tne 

tit 
ltu 
tge 
tgeu 


destination, src 


Convert Source to 
Specified Fp Precision 

Double to Single Fp 
Fixed Point to Single Fp 
Single to Double Fp 
Fixed Point to Double Fp 
Single to Fixed Point Fp 
Double to Fixed Point Fp 


cvts.d 

cvts.w 

cvtd.s 

cvt.d.w 

cvt.w.s 

cvt.w.d 


destination, srcl 



( 
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Table 63 Floating Point Computational Instruction Formats (continued) 



Description 


Op-code 


Operand 


Truncate and Round 






Operations 






Truncate to Single Fp 
Truncate to Double Fp 


truncw.s 
trunc.w.d 




Round to Single Fp 
Round to Double Fp 


round.w.s 
round.w.d 




Ceiling to Double Fp 


ceil.w.d 




Ceiling to Single Fp 


ceil.w.s 




Ceiling to Double Fp, Unsigned 


ceilu.w.d 


destination, src, gpr 


Ceiling to Single Fp, Unsigned 


ceilu.w.s 




Floor to Double Fp 


floor.w.d 




Floor to Single Fp 


floor.w.s 




Floor to Double Fp, Unsigned 


flooru.w.d 




Floor to Single Fp, Unsigned 


flooru.w.s 




Round to Double Fp, Unsigned 


roundu.w.d 




Round to Single Fp, Unsigned 


roundu.w.s 




Truncate to Double Fp, Unsigned 


truncu.w.d 




Truncate to Single Fp, Unsigned 


truncu.w.s 





Floating Point Computational Instruction Descriptions 



This part of Chapter 6 groups the instructions by function. Refer to Table 6.3 
for the op-code names. 
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Table 6.4 Floating Point Computational Instruction Descriptions 



Instruction 



Description 



Absolute Value Fp Instructions 



Add Fp Single Instructions 



Convert Source to Another 
Precision Fp Instructions 

Truncate and Round Instructions 



Divide Fp Instructions 



Multiply Fp Instructions 



Compute the absolute value of the contents of srcl and 
put the specified precision floating point result in the 
destination register. 

Add the contents of srcl (or the destination) to the con- 
tents of src2 and put the result in the destination register. 
When the sum of two operands with opposite signs is 
exactly zero, the sum has a positive sign for all rounding 
modes except round toward -1. For that rounding 
mode, the sum has a negative sign. 

Convert the contents of srcl to the specified precision, 
round according to the rounding mode, and put the re- 
sult in the destination register. 

The trunc instructions truncate the value in the source 
floating-point register and put the resulting integer in 
the destination floating-point register, using the third 
(general-purpose) register to hold a temporary value. 
(This is a macro-instruction.) The round instructions 
work like trunc, but round the floating-point value to 
an integer instead of truncating it. 

Compute the quotient of two values. These instruc- 
tions treat srcl as the dividend and src2 as the divisor. 
Divide Fp instructions divide the contents of srcl by 
the contents of src2 and put the result in the destination 
register. If the divisor is a zero, the machine signals a 
error if the divide-by-zero exception is enabled. 

Multiplies the contents of srcl (or the destination) with 
the contents of srcl and puts the result in the destina- 
tion register. 



( 



c 



c 
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Table 6 A Floating Point Computational Instruction Descriptions (continued) 



Instruction 


Description 


Negate FP Instructions 
Subtract Fp Instructions 


Compute the negative value of the contents of srcl and 
put the specified precision floating point result in the 
destination register. 

Subtract the contents of src2 from the contents of srcl 
(or the destination). These instructions put the result in 
the destination register. When the difference of two 
operands with the same signs is exactly zero, the dif- 
ference has a positive sign for all rounding modes ex- 
cept round toward -1. For that rounding mode, the 
sum has a negative sign. 



Floating Point Relational Operations 



Table 6.5 summarizes the floating point relational instructions. The first column 
under Condition gives a mnemonic for the condition tested. As the "branch on 
true/false" condition can be used to logically negate any condition, the second 
column supplies a mnemonic for the logical negation of the condition in the first 
column. This provides a total of 32 possible conditions. The four columns under 
Relations give the result of the comparison based on each condition. The final 
column states if an invalid operation is signaled for each condition. 

For example, with an equal condition (EQ mnemonic in the True column), the 
logical negation of the condition is not equal (NEQ), and a comparison that is 
equal is True for equal and False for greater than, less than, and unordered, and 
no Invalid Operation Exception is given if the relation is unordered. 
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Table 65 Floating Point Relational Operators 



Condition 


Relations 


Invalid 
Operation 

Exception if 
Unordered 


Mnemonic 


Greater 
Than 


Less 
Than 


Equal 


Unordered 


True 


False 


F 

UN 

EQ 

UEQ 

OLT 

ULT 

OLE 

ULE 


T 

OR 
NEQ 
OLG 
UGE 
OGE 
UGT 
OGT 


F 
F 
F 
F 
F 
F 
F 
F 


F 
F 
F 
F 
T 
T 
T 
T 


F 
F 
T 
T 
F 
F 
T 
T 


F 
T 
F 
T 
F 
T 
F 
T 


no 
no 
no 
no 
no 
no 
no 
no 


SF 
NGLE 
SEQ 
NGL 
LT 
NGE 
LE 
NGT 


ST 
GLE 

SNE 
GL 
NLT 
GE 
NLE 
GT 


F 
F 
F 
F 
F 
F 
F 
F 


F 
F 
F 
F 
T 
T 
T 
T 


F 
F 
T 
T 
F 
F 
T 
T 


F 
T 
F 
T 
F 
T 
F 
T 


yes 
yes 
yes 
yes 
yes 
yes 
yes 
yes 



c 



The mnemonics in Table 6.5 have the following meanings: 



F 


False 


T 


True 


UN 


Unordered 


OR 


Ordered 


EQ 


Equal 


NEQ 


Not Equal 


UEQ 


Unordered or Equal 


OLG 


Ordered or Less Than 
or Greater Than 


OLT 


Ordered Less Than 


UGE 


Unordered or Greater 
Than or Equal 


ULT 


Unordered or Less 
Than 


OGE 


Ordered Greater Than 


OLE 


Ordered Less Than 


UGT 


Unordered or Greater 




or Equal 




Than 


ULE 


Unorderd or Less 
Than or Equal 


OGT 


Ordered Greater Than 


SF 


Signaling False 


ST 


Signaling True 


NGLE 


Not Greater Than or 


GLE 


Greater Than, or 




Less Than or Equal 




Less Than or Equal 


SEQ 


Signaling Equal 


SNE 


Signaling Not Equal 


NGL 


Not Greater than or 
Less Than 


GL 


Greater Than or Less Than 


LT 


Less Than 


NLT 


Not Less Than 


NGE 


Not Greater Than 
or Equal 


GE 


Greater Than or Equal 


LE 


Less Than or Equal 


NLE 


Not Less Than or Equal 


NGT 


Not Greater Than 


GT 


Greater Than 



( 



c 
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To branch on the result of a relational: 

/* branching on a compare result */ 

c.eq.s $fl,$f2 /* compare the single precision values */ 
belt true /* if $fl equals $f2, branch to true */ 
bclf false /* if $fl does not equal $f2, branch to */ 
/* false */ 

Floating Point Relational Formats 

In the table below, srcl and src2 refer to the source registers. 

NOTE: These are the most common Compare instructions. The machine pro- 
vides other Compare instructions for IEEE compatibility. 

Table 6.6 Floating Point Relational Instruction Formats 



Description 


Op-code 


Operand 


Compare F 






Double 


cf.d 


srcl,src2 


Single 


cf.s 




Compare UN 






Double 


c.un.d 




Single 


c.un.s 




♦Compare EQ 






Double 


ceq.d 




Single ^ 


ceq.s 




Compare UEQ 






Double 


c.ueq.d 




Single 


c.ueq.s 




Compare OLT 






Double 


c.olt.d 




Single 


colts 




Compare ULT 






Double 


cultd 




Single 


cults 




Compare OLE 






Double 


cole.d 




Single 


coles 




Compare ULE 






Double 


cule.d 




Single 


cule.s 




Compare SF 






Double 


csf.d 




Single 


csf.s 
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Table 6.6 Floating Point Relational Instruction Formats (continued) 



( 



Description 


Op-code 


Operand 


Compare NGLE 


cngle.d 




Double 


srcl,src2 


Single 


cngteoS 




Compare SEQ 






Double 


cseq.d 




Single 


c.seq.s 




Compare NGL 






Double 


cjrigLd 




Single 


CoHghS 




*Compare LT 


clt d 




Double 






Single 


cits 




Compare NGE 


c.nge.d 




Double 




Single 


c.ngejs 




*Compare LE 






Double 


c.le.d 




Single 


cae.s 




Compare NGT 
Double 


cngtd 




Single 


c.ngtoS 





c 



Floating Point Relational Instruction Descriptions 



This part of Chapter 6 describes the relational instruction descriptions by func- 
tion. Refer to Chapter 1 for information regarding registers. Please consult 
Table 6.6 for the op-code names. 

Table 6 J Floating Point Relational Instruction Descriptions 



Instruction 



Description 



Compare EQ Instructions 



Compare F Instructions 



Compare LE Instructions 



Compare the contents of srcl with the contents of src2. If 
srcl equals src2 a true condition results; otherwise, a false 
condition results. The machine does not signal an excep- 
tion for unordered values. 

Compare the contents of srcl with the contents of src2. 
These instructions always produce a false condition. The 
machine does not signal an exception for unordered val- 
ues. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than or equal to src2, a true condition results; 
otherwise, a false condition results. The machine signals 
an exception for unordered values. 
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Table 67 Floating Point Relational Instruction Descriptions (continued) 



Instruction 



Description 



Compare LT Instructions 



Compare NGE Instructions 



Compare NGL Instructions 



Compare NGLE Instructions 



Compare NGT Instructions 



Compare OLE Instructions 



Compare OLT Instructions 



Compare SEQ Instructions 



Compare SF Instructions 



Compare the contents of srcl with the contents of src2. If 
srcl is less than src2, a true condition results; otherwise, a 
false condition results. The machine signals an exception 
for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than src2 (or the contents are unordered), a true 
condition results; otherwise, a false condition results. The 
machine signals an exception for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl equals src2 or the contents are unordered, a true condi- 
tion results; otherwise, a false condition results. The ma- 
chine signals an exception for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl is unordered, a true condition results; otherwise, a 
false condition results. The machine signals an exception 
for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than or equal to src2 or the contents are unor- 
dered, a true condition results; otherwise, a false condition 
results. The machine signals an exception for unordered 
values. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than or equal to src2, a true condition results; 
otherwise, a false condition results. The machine does not 
signal an exception for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than src2, a true condition results; otherwise, a 
false condition results. The machine does not signal an 
exception for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl equals src2, a true condition results; otherwise, a 
false condition results. The machine signals an exception 
for unordered values. 

Compare the contents of srcl with the contents of src2. 
This always produces a false condition. The machine sig- 
nals an exception for unordered values. 
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Table 67 Floating Point Relational Instruction Descriptions (continued) 



Instruction 



Description 



Compare ULE Instructions 



Compare UEQ Instructions 



Compare ULT Instructions 



Compare UN Instructions 



Compare the contents of srcl with the contents of src2. If 
srcl is less than or equal to src2 (or srcl is unordered), a 
true condition results; otherwise, a false condition results. 
The machine does not signal an exception for unordered 
values. 

Compare the contents of srcl with the contents of src2. If 
srcl equals src2 (or srcl and src2 are unordered), a true 
condition results; otherwise, a false condition results. The 
machine does not signal an exception for unordered values. 

Compare the contents of srcl with the contents of src2. If 
srcl is less than src2 (or the contents are unordered), a true 
condition results; otherwise, a false condition results. The 
machine does not signal an exception for unordered val- 
ues. 

Compare the contents of srcl with the contents of src2. If 
either srcl or src2 is unordered, a true condition results; 
otherwise, a false condition results. The machine does not 
signal an exception for unordered values. 



Floating Point Move Formats 

The floating point coprocessor's move instructions move data from source to 
destination registers (only floating point registers are allowed). 

Table 6.8 Floating Point Move Instruction Formats 



Description 


Op-code 


Operand 


Move Fp 

Double 
Single 


mov.d 
mov.s 


destination, srcl 



c 



c 



Floating Point Move Instruction Descriptions 

This part of Chapter 6 describes the floating point move instructions. Please 
consult Table 6.8 for the op-code names. 

Table 6.9 Floating Point Move Instruction Descriptions 



Instruction 


Description 


Move Fp Instructions 


Move the double or single precision contents of srcl to the 
destination register, maintaining the specified precision. 



( 
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System Control Coprocessor Instructions 

The system control coprocessor (cpO) handles all functions and special and privi- 
leged registers for the virtual memory and exception handling subsystems. The 
system control coprocessor translates addresses from a large virtual address 
space into the machine's physical memory space. The coprocessor uses a trans- 
lation lookaside buffer (TLB) to translate virtual addresses to physical addresses. 

System Control Coprocessor Formats 

These coprocessor system control instructions do not have operands: 
Table 6.10 System Control Instruction Formats. 



Description 


Op-code 


Translation Lookaside Buffer Probe 
Translation Lookaside Buffer Read 
Translation Lookaside Buffer Write Random 
Translation Lookaside Write Index 


tlbp 

Ubr 

tlbwr 

tlbwi 



System Control Coprocessor Instruction Descriptions 

This part of Chapter 6 describes the system control coprocessor instructions. 
Table 6.11 System Control Coprocessor Instruction Descriptions 



Instruction 



Translation Lookaside Buffer Probe 

(tlbp) 



Translation Lookaside Buffer Read 

(tlbr) 

Translation Lookaside Buffer 
Write Random 

(tlbwr) 

Translation Lookaside Buffer 
Write Index 

(tlbwi) 



Description 



Probes the translation lookaside buffer (TLB) to see if the TLB has 
an entry that matches the contents of the EntryHi register. If a 
match occurs, the machine loads the Index register with the num- 
ber of the entry that matches the EntryHi register. If no TLB entry 
matches, the machine sets the high-order bit of the Index register. 

Loads the EntryHi and EntryLo registers with the contents of the 
translation lookaside buffer (TLB) entry specified in the TLB Index 
register. 

Loads the specified translation lookaside buffer (TLB) entry with 
the contents of the EntryHi and EntryLo registers. The contents 
of the TLB Random register specify the TLB entry to be loaded. 

Loads the specified translation lookaside buffer (TLB) entry with 
the contents of the EntryHi and EntryLo registers. The contents 
of the TLB Index register specify the TLB entry to be loaded. 



Control and Status Register 

Floating-point coprocessor control register 31 contains status and control infor- 
mation. It controls the arithmetic rounding mode and the enabling of user-level 
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traps, and indicates exceptions that occurred in the most recently executed in- 
struction, and any exceptions that may have occurred without being trapped. 





31 24 23 


22 18 17 12 11 




7 6 2 


1 









c 





exceptions 


enables 


sticky- 
bits'' 


RM 




BITS: 8 1 


5 6 


5 


5 


2 






Control and Status Register 
(c = compare bit) 










11 10 9 8 7 
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V 


Z 
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E 


V Z 


U I 




V 


Z 


U I 






Enable Bits 






Exception Bits 






Sticky Bits 







( 



Figure 62 Floating Control and Status Register 31 

The exception bits are set for instructions that cause an IEEE standard exception 
or an optional exception used to emulate some of the more hardware-intensive 
features of the IEEE standard. 

The exception field is loaded as a side-effect of each floating-point operation 
(excluding loads, stores, and unformatted moves). The exceptions which were 
caused by the immediately previous floating-point operation can be determined 
by reading the exception field. 

The meaning of each bit in the exception field is given below. If two exceptions 
occur together on one instruction, the field will contain the inclusive OR of the 
bits for each exception. 



c 



Exception 
Field Bit 


Description 


E 

V 

z 
I 



u 


Unimplemented Operation 
Invalid Operation 
Division by Zero 
Inexact Exception 
Overflow Exception 
Underflow Exception 



The unimplemented operation exception is normally invisible to user-level code. 
It is provided to maintain IEEE compatibility for non-standard implementations. 

The five IEEE standard exceptions are listed below: 



( 
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Field 


Description 


V 


Invalid Operation 


z 


Division by Zero 


I 


Inexact Exception 





Overflow Exception 


u 


Underflow Exception 



Each of the five exceptions is associated with a trap under user control, which is 
enabled by setting one of the five bits of the enable field, shown above. 

When an exception occurs, both the corresponding exception and status bits are 
set. If the corresponding enable flag bit is set, a trap is taken. In some cases the 
result of an operation is different if a trap is enabled. 

The status flags are never cleared as a side effect of floating-point operations, but 
may be set or cleared by writing a new value into the status register, using a 
"move to coprocessor control" instruction. 

The floating-point compare instruction places the condition which was detected 
into the "c" bit of the control and status register, so that the state of the condition 
line may be saved and restored. The "c" bit is set if the condition is true, and 
cleared if the condition is false, and is affected only by compare and move to 
control register instructions. 

Exception Trap Processing 

For each IEEE standard exception, a status flag is provided that is set on any oc- 
currence of the corresponding exception condition with no corresponding excep- 
tion trap signaled. It may be reset by writing a new value into the status register. 
The flags may be saved and restored individually, or as a group, by software. 
When no exception trap is signaled, a default action is taken by the floating-point 
coprocessor, which provides a substitute value for the original, exceptional, result 
of the floating-point operation. The default action taken depends on the type of 
exception, and in the case of the Overflow exception, the current rounding mode. 

Invalid operation exception 

The invalid operation exception is signaled if one or both of the operands are in- 
valid for an implemented operation. The result, when the exception occurs with- 
out a trap, is a quiet NaN when the destination has a floating-point format, and is 
indeterminate if the result has a fixed-point format. The invalid operations are: 

1) Addition or subtraction: magnitude subtraction of infinities, such 
as(+l)-(-l) 
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2) Multiplication: times 1, with any signs d~ 

3) Division: over or 1 over 1, with any signs 

4) Square root: /"x, where x is less than zero 

5) Conversion of a floating-point number to a fixed-point format 
when an overflow, or operand value of infinity or NaN, pre- 
cludes a faithful representation in that format 

6) Comparison of predicates involving < or > without ?, when the 
operands are "unordered" 

7) Any operation on a signaling NaN. 

Software may simulate this exception for other operations that are invalid for the 
given source operands. Examples of these operations include IEE&-specified 
functions implemented in software, such as Remainder: x REM y, where y is 
zero or x is infinite; conversion of a floating-point number to a decimal format 
whose value causes and overflow or is infinity of NaN; and trancendental func- 
tions, such as In (-5) or cos ~~i3). 

Division-by-zero exception 

The division by zero exception is signaled on an implemented divide operation if 
the divisor is zero and the dividend is a finite nonzero number. The result, when 
no trap occurs, is a correctly signed infinity. 

If division by zero traps are enabled, the result register is not modified, and the 
source registers are preserved. 

Software may simulate this exception for other operations that produce a signed 
infinity, such as ln(0), sec(p/2), csc(O) or . 

Overflow exception 

The overflow exception is signaled when what would have been the magnitude of 
the rounded floating-point result, were the exponent range unbounded, is larger 
than the destination format's largest finite number. The result, when no trap oc- 
curs, is determined by the rounding mode and the sign of the intermediate result. 

If overflow traps are enabled, the result register is not modified, and the source 
registers are preserved. 

Underflow exception 

Two related events contribute to underflow. One is the creation of a tiny non- 
zero result between + or- 2 E min (minimum expressable exponent) which, be- 
cause it is tiny, may cause some other exception later. The other is extraordinary 
loss of accuracy during the approximation of such tiny numbers by denormalized 
numbers. 

The IEEE standard permits a choice in how these events are detected, but re- 
quires that they must be detected the same way for all operations. 
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The IEEE standard specifies that "tininess" may be detected either: "after round- 
ing" (when a nonzero result computed as though the exponent range were un- 
bounded would lie strictly between + or - 2 E min , r "before rounding" (when a 
nonzero result computed as though the exponent range and the precision were 
unbounded would lie strictly between + or- 2 E min . The architecture requires 
that tininess be detected after rounding. 

Loss of accuracy may be detected as either "denormalization loss" (when the de- 
livered result differs from what would have been computed if the exponent range 
were unbounded), or "inexact result" (when the delivered result differs from what 
would have been computed if the exponent range and precision were both un- 
bounded). The architecture requires that loss of accuracy be detected as inexact 
result. 

When an underflow trap is not enabled, underflow is signaled (via the underflow 
flag) only when both tininess and loss of accuracy have been detected. The de- 
livered result might be zero, denormalized, or + or - 2 Emin . When an under- 
flow trap is enabled, underflow is signaled when tininess is detected regardless of 
loss of accuracy. 

If underflow traps are enabled, the result register is not modified, and the source 
registers are preserved. 

Inexact exception 

If the rounded result of an operation is not exact or if it overflows without an 
overflow trap, then the inexact exception is signaled. The rounded or overflowed 
result is delivered to the destination register, when no inexact trap occurs. If in- 
exact exception traps are enabled, the result register is not modified, and the 
source registers are preserved. 

Unimplemented operation exception 

If an operation is specified that the hardware may not perform, due to an imple- 
mentation restriction on the supported operations or supported formats, an 
unimplemented operation exception may be signaled, which always causes a trap, 
for which there are no corresponding enable or flag bits. The trap cannot be dis- 
abled. 

This exception is raised at the execution of the unimplemented instruction. The 
instruction may be emulated in software, possibly using implemented floating- 
point unit instructions to accomplish the emulation. Normal instruction execu- 
tion may then be restarted. 

This exception is also raised when an attempt is made to execute an instruction 
with an operation code or format code which has been reserved for future archi- 
tectural definition. The unimplemented instruction trap is not optional, since the 
current definition contains codes of this kind. 

This exception may be signaled when unusual operands or result conditions are 
detected, for which the implemented hardware cannot properly handle the condi- 
tion. These may include (but are not limited to), denormalized operands or re- 
sults, NaN operands, trapped overflow or underflow conditions. The use of this 
exception for such conditions is optional. 
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Floating Point Rounding 



Bits and 1 of the coprocessor control register 31 sets the rounding mode for 
floating point. The machine allows four rounding modes: 

Round to nearest rounds the result to the nearest representable 
value. When the two nearest representable values are equally 
near, this mode rounds to the value with the least significant bit 
zero. To select this mode, set bits 1..0 of control register 31 to 0. 



c 



Round toward zero rounds toward zero. It rounds to the value 
that is closest to and not greater in magnitude than the infinitely 
precise result. To select this mode, set bits 1..0 of control regis- 
ter 31 to 1. 

Round toward positive infinity rounds to the value that is clos- 
est to and not less than the infinitely precise result. To select this 
mode, set bits 1..0 of control register 31 to 2. 

Round toward negative infinity rounds toward negative infin- 
ity. It rounds to the value that is closest to and not greater than 
the infinitely precise result. To select this mode, set bits 1..0 of 
control register 31 to 3. 

To set the rounding mode: 

/* setting the mounding mode */ 

RoundNearest = OxO 

RoundZero = Oxl 

RoundPosInf = 0x2 

RoundNeglnf = 0x3 

cfcl rt2, $31 # move from coprocessor 1 

and rt, Oxfffffffc # zero the round mode bits 
or rt, RoundZero # set mask as round to zero 
ctcl rt, $f31 # move to coprocessor 1 



( 



c 
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Introduction 



This chapter gives rules and examples to follow when designing an assembly lan- 
guage program. The chapter concludes with a 'learn by doing" technique that 
you can use if you still have any doubts about how a particular calling sequence 
should work. This involves writing a skeleton version of your prospective as- 
sembly routine using a high level language, and then compiling it with the -S 
option to generate a human-readable assembly language file. The assembly lan- 
guage file can then be used as the starting point for coding your routine. 



When you write assembly language routines, you should follow the same calling 
conventions that the compilers observe, for two reasons: 

• Often your code must interact with compiler-generated code, 
accepting and returning arguments or accessing shared global 
data. 

• The symbolic debugger gives better assistance in debugging pro- 
grams using standard calling conventions. 

The conventions for the compiler system are a bit more complicated than some, 
mostly to enhance the speed of each procedure call. Specifically: 

• The compilers use the full, general calling sequence only when 
necessary; where possible, they omit unneeded portions of it. 
For example, the compilers don't use a register as a frame 
pointer whenever possible. 

• The compilers and debugger observe certain implicit rules rather 
than communicating via instructions or data at execution time. 
For example, the debugger looks at information placed in the 
symbol table by a ".frame" directive at compilation time, so that 
it can tolerate the lack of a register containing a frame pointer at 
execution time. 



Program Design 



This section describes three general areas of concern to the assembly language 
programmer: 

• usable and restricted registers 

• stack frame requirements on entering and exiting a routine 

• the "shape" of data (scalars, arrays, records, sets) laid out by the 
various high level languages. 
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Register Use and Linkage 



The main processor has 32 32-bit integer registers. The uses and restrictions of 
these registers are described in Table LI in Chapter 1. 

The floating point coprocessor has 16 floating point registers. Each register can 
hold either a single precision (32 bit) or a double precision (64 bit) value. All 
references to these registers uses an even register number (e.g., $f4). Refer to 
Table 7.1 for details. 

Table 7 J Floating Point Registers 



c 



Floating Point Registers 


register name 


use and linkage 


$K)..f3 


used to hold floating point type function 




results ($f0) and complex type function 




results ($f0 has the real part, $12 has 




the imaginary part) 


$f4..fl0 


temporary registers, used for expression 




evaluation, whose values are not pre- 




served across procedure calls. 


$fl2..$fl4 


used to pass the first 2 single or double 




precision actual arguments, whose values 




are not preserved across procedure calls. 


$fl6..$fl8 


temporary registers, used for expression 




evaluations, whose values are not pre- 




served across procedure calls. 


$f20,.$f30 


saved registers, whose values must be 




preserved across procedure calls. 



( 



The Stack Frame 



The compilers classify each routine into one of of the following categories: 

• non-leaf routines, that is, routines that call other procedures 

• leaf routines, that is, routines that do not themselves execute any 
procedure calls. Leaf routines are of two types: 

o leaf routines that require stack storage for local variables 

o leaf routines that do not require stack storage for local vari- 
ables, 
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You must decide the routine category before determining the calling sequence. 

To write a program with proper stack frame usage and debugging capabilities, 
use the following procedure: 

1.. Regardless of the type of routine, you should include a .ent pseudo-op and 
an entry label for the procedure. The ,ent pseudo-op is for use by the debug- 
ger, and the entry label is the procedure name. The syntax is: 

ent procedure_name 

procedure_name : 

2. If you are writing a leaf procedure that does not use the stack, skip to step 3. 
For leaf procedure that uses the stack or non-leaf procedures, you must allo- 
cate all the stack space that the routine requires. The syntax to adjust the 
stack size is: 

subu $sp, f ramesize 

where framesize is the size of frame required. Space must be allocated for: 

• local variables 

• saved general registers. Space should be allocated only for those 
registers saved. For non-leaf procedures, you must save $31, 
which is used in the calls to other procedures from this routine. 
If you use registers $16-$23, you must also save them. 

• saved floating point registers. Space should be allocated only for 
those registers saved. If you use registers $f20-$f30 you must 
also save them. 

• Procedure call argument area. You must allocate the maximum 
number of bytes for arguments of any procedure that you call 
from this routine. 

NOTE: Once you have modified $sp, you should not modify it again for the rest 
of the routine. 

3. Now include a .frame pseudo-op: 

. frame f ramereg, f ramesize, returnreg 

The virtual frame pointer is a frame pointer as used in other compiler systems but 
has no register allocated for it. It consists of the framereg ($sp, in most cases) 
added to the framesize (see step 2 above). Figure 7.1 illustrates the stack compo- 
nents. 
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high memory 



virtual frame 
pointer ($fp) 



frame 
offset 



\ 



stack 

pointer ($sp) 

(framereg) 



low memory 



argument n 



argument 1 



local & temporaries 



saved registers 

(including retumreg ) 



argument build 



framesize 



Figure 7 J Stack Organization 



The retumreg specifies the register the return address is in (usually $31). These 
usual values may change if you use a varying stack pointer or are specifying a 
kernel trap routine, 

4. If the procedure is a leaf procedure that does not use the stack, skip to step 7. 
Otherwise you must save the registers you allocated space for in step 2. 

To save the general registers, use the following operations: 



.mask 

sw 



bitmask, f rameof f set 

reg, framesize+f rameof fset-N ($sp) 



The .mask directive specifies the registers to be stored and where they are stored. 
A bit should be on in hitmask for each register saved (for example, if register $31 
is saved, bit 31 should be T in bitmask. Bits are set in bitmask in little-endian 
order, even if the machine configuration is big-endian). The frameoffset is the 
offset from the virtual frame pointer (this number is usually negative). N should 
be for the highest numbered register saved and then incremented by four for 
each subsequently lower numbered register saved. For example: 



sw 
sw 
sw 



$31, framesize+f rameof f set ($sp) 
$17, framesize+f rameof fset-4 ($sp) 
$16, framesize+f rameof fset-8 ($sp) 



( 



( 



Figure 7.2 illustrates this example. 



( 



7-4 



Assembly Language Programmer's Guide 



Unking Conventions 



high memory 



virtual frame 
pointer ($fp) 



frame 
offset 



\ 



stack 

pointer ($sp) 



low memory 



saved $31 



saved $17 
saved $16 



1 



framesize 



Figure 7.2 Stack Example 



Now save any floating point registers that you allocated space for in step 2 as 
follows: 



. fmask 
s . [sd] 
fset-N($sp) 



bitmaskv f rameof f set 
reg, framesize+f rameof - 



Notice that saving floating point registers is identical to saving general regis- 
ters except we use the .fmask pseudo-op instead of .mask, and the stores are 
of floating point singles or doubles. The discussion regarding saving general 
registers applies here as well, but remember that N should be incremented by 
8 for doubles. 

This step describes parameter passing: how to access arguments passed into 
your routine and passing arguments correctly to other procedures. For infor- 
mation on high-level language specific constructs (call-by-name, call-by- 
value, string or structure passing), refer to Chapter 3 of the Language Pro- 
grammer's Guide. 

As specified in step 2, space must be allocated on the stack for all arguments 
even though they may be passed in registers. This provides a saving area if 
their registers are needed for other variables. 

General registers $4-$7 and float registers $fl2, $fl4 must be used for pass- 
ing the first four arguments (if possible). You must allocate a pair of regis- 
ters (even if it's a single precision argument) that start with an even register 
for floating point arguments appearing in registers. 



Assembly Language Programmer's Guide 



7-5 



Chapter 7 



In the table below, the W arguments are considered single and double pre- 
cision floating point arguments, and 'nN' arguments are everything else. The 
ellipses (...) mean that the rest of the arguments do not go in registers regard- 
less of their type. The 'stack' assignment means that you do not put this ar- 
gument in a register. The register assignments occur in the order shown in 
order to satisfy optimizing compiler protocols. 



c 



Arguments 


Register Assignments 


(fl, O, ...) 
(fl.nl, £2,...) 
(fl,nl,n2,...) 


fl->$fl2,f2->$fl4 

fl -> $fl2, nl -> $6, f2 -> stack 

fl -> $fl2, nl -> $6, n2 -> $7 


(nl,n2,n3,n4, ...) 
(nl,n2,n3,fl,...) 
(nl.n2.fl,...) 
(nl, fl, ...) 


nl -> $4, n2 -> $5, n3 -> $6, n4 -> $7 
nl -> $4, n2 -> $5, n3 -> $6, fl -> stack 
nl -> $4, n2 -> $5, fl -> ($6, $6) 
nl -> $4, fl -> ($6, $7) 



6. Next, you must restore registers that were saved in step 4. To restore general 
purpose registers: 

lw reg, f ramesize+f rameof f set-N ($sp) 
To restore the floating point registers: 

1. [sd] reg, framesize+frameoff set-N($sp) 
(Refer to step 4 for a discussion of the value of//.) 

7. Get the return address: 

lw $31, framesize+frameof f set ($sp) 

8. Clean up the stack: 

addu $sp, frames ize 

9. Return: 

j $31 

10. To end the procedure: 

.end procedurename 
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The Shape of Data 



Examples 



In most cases, high-level language routine and assembly routines communicate 
via simple variables: pointers, integers, booleans, and single- and double-preci- 
sion real numbers. Describing the details of the various high-level data struc- 
tures (arrays, records, sets, and so on) is beyond our scope here. If you need to 
access such a structure as an argument or as a shared global variable, refer to 
Chapter 3 of the Language Programmer's Guide, and the "Learn by Doing" 
technique described at the end of this section. 



This section contains the examples that illustrate program design rules; each ex- 
ample shows a procedure written and C and its equivalent written in assembly 
language. 

Figure 7.3 shows a non-leaf procedure. Notice that it creates a stackframe, and 
also saves its return address since it must put a new return address into register 
$31 when it invokes its callee: 
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float 
nonleaf (i, j) 

int i, *j; 

{ 

double atof () ; 

,int temp; 

temp = i - *j; 

if (i < *j) temp = -temp; 

return atof (temp) ; 

} 



# 
# 
# 
# 


1 

2 
3 
4 


.globl 
float 
nonleaf (i, j) 

int i, *j; 

{ 


nonleaf 




Leaf " 


. ent 


nonleaf 2 


# 
# 


5 
6 


subu 
sw 

.mask 
. frame 

double atof () 

int temp; 


$sp, 24 
$31, 20($sp) 
0x80000000, -4 
$sp, 24, $31 



# 7 

# 8 



$32: 
# 10 



temp = i - *j; 

lw $2, 0($5) 

subu $3, $4, $2 

if (i < *j) temp = -temp; 

bge $4, $2, $32 

negu $3, $3 

return atof (temp) ; 



move 


$4, $3 


jal 


atof 


cvt . s . d 


$f0, $f0 


lw 


$31, 20($sp) 


addu 


$sp, 24 


3 


$31 


.end 


nonleaf 



## Create stackframe 

## Save the return address 



## Arguments are in $4 and $5 



## Note: $32 is a label, not a register 



## Return value goes in $f 
## Restore return address 
## Delete stackframe 
## Return to caller 



C 



Figure 7 J. Non-Leaf Procedure 

Figure 7.4 shows a leaf procedure that does not require stack space for local vari- 
ables. Notice that it creates no stackframe, and saves no return address: 
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int 










leaf (pi, 


p2) 








int pl ; 

{ 
return 

} 


p2; 








(Pi 


> p2) ? pi 


: p2; 








.globl 




leaf 


# 1 




int 






# 2 




leaf (pi, p2) 




# 3 




int pi, 


p2; 




# 4 




{ 
.ent 




leaf 2 


leaf: 




. frame 




$sp, 0, $31 


# 5 




return 
ble 
move 
b 


(pl > 


p2) ? pl : p2; 

$4, $5, $32 ## Arguments in $4 and $5 

$3, $4 

$33 


$32: 




move 




$3, $5 


$33: 




move 
j 




$2, $3 ## Return value goes in $2 
#31 ## Return to caller 


# $ 




} 






1 




.end leaf 





Figure 7.4. Leaf Procedure Without Stack Space for Local Variables 

Figure 7.5 shows a leaf procedure that requires stack space for local variables. 
Notice that it creates a stack frame, but does not save a return address. 
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char 
leaf_storage (i) 

int i ; 

{ 

char a [16] ; 

int j; 

for (j = 0; j < 10; j++) 



a[j] - '0' 
for (j = 10; 

a[j] = 'a' 
return a [ i ] ; 
} 



16; j++) 



leaf_storage: 



$32: 
# 



# 10 



$33: 
# 



11 



# 12 



.globl 

char 

leaf__storage (i 

int i; 

{ 
.ent 

subu 
. frame 

char a [16] ; 

int j; 



leaf_storage 



leaf storage 2 # * " 2 " is the lexical l evel of the 
— ## procedure o You may omit it. 



$sp, 24 
$sp, 24, $31 



## Create stackframe 



for 
sw 
addu 



(j - 0; 



a[j] 
lw 

addu 
addu 
sb 
lw 

addu 
sw 
bit 

for 
li 
sw 



(j - 10; 



a[j] = 
lw 

addu 
addu 
sb 
lw 

addu 
sw 
bit 

return 
addu 
lbu 
addu 

j 
-end 



'a' 



a[i] 



j < 10; j++) 
$0, 4($sp) 
$3, $sp, 24 

+ j; 

. $14 f 4($sp) 
$15, $14, 48 
$24, $3, $14 
$15, -16($24). 
$25, 4($sp) 
$8, $25, 1 
$8, 4($sp) 
$8, 10, $32 

j < 16; j++) 
$9, 10 
$9, 4($sp) 

+ j; 

$10, 4($sp) 

$11, $10, 97 

$12, $3, $10 

$11, -16($12) 

$13, 4($sp) 

$14, $13, 1 

$14, 4($sp) 

$14, 16, $33 

$15, $3, $4 
$2, -16($15) 
$sp, 24 
$31 
leaf_storage 



## Argument is in $4 

## Return value goes in $2 

## Delete stackframe 

## Return to caller 



C 



( 



Figure 7.5. Leaf Procedure With Stack Space for Local Variables 
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Learning by Doing 



The rules and parameter requirements required between assembly language and 
other languages are varied and complex. The simplest approach to coding an 
interface between an assembly routine and a routine written in a high-level lan- 
guage is to do the following: 

• Use the high-level language to write a skeletal version of the 
routine that you plan to code in assembly language. 

• Compile the program using the — S option, which creates an as- 
sembly language (.s) version of the compiled source file. 

• Study the assembly-language listing and then, imitating the rules 
and conventions used by the compiler, write your assembly lan- 
guage code. 

The next two sections illustrate techniques to use in creating an interface between 
assembly language and high-level language routines. The examples shown are 
merely to illustrate what to look for in creating your interface. Details such as 
register numbers will vary according to the number, order, and data types of the 
arguments. You should write and compile realistic examples of your own code 
in writing your particular interface. 



Calling a High-Level Language Routine 



The following steps show a technique to follow in writing an assembly language 
routine that calls atof, a routine written in C that converts ASCII characters to 
numbers; for more information, see the atof(3) in the UNIX Programmer' $ Man- 
ual. 

1 . Write a C program that calls atof. Pass global rather than local variables; this 
makes them to recognize in the assembly language version of the C program, 
(and ensures that optimization doesn't remove any of the code on the 
grounds that it has no effect.) 

Below is an example of a C program that calls atof. 



char c[] : 
double d, 
float f; 
caller () 

{ 

d = atof (c) 

f = (float) 

} 



= "3.1415'' 
atof () ; 



c is declared as a 
global variable. 



atof (c) 



2. Compile the program using the using the compiler options shown below: 

cc -S -O caller. c 

The — S option causes the compiler to produce the assembly-language list- 
ing; the — O option, though not required, reduces the amount of code gener- 
ated, making the listing easier to read. 
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3. After compilation, look at the file caller.s (shown below). The highlighted 
section of the listing shows how the parameters are passed, the execution of 
the call, and how the returned values are retrieved. 



c 







.<g\lol>l 


^^^^^^^H 






q; 




III : % ;|j:i|;|j|||: j|||| | 










<word 


^^^^^H 


Ill 








<word 


13«17 : !• 










lillliliiiiliillllillliill 


d 8 










. <COJTO 


t 4 










lllililllllil 


illlll 










.text . 












Ill^i^llliiiiiiiiiii 


caller 2 






calle-r : 














|!ii|^ill|||illl|;:| 


$sp, 24 










|:|:|i|||||||:|||||i 


llilfi?2l;;|isil; 










IIIIjII 


oxeooooooo, 


III 








+ f r^me 


$sp, 24 r $31 






11111 


iiii 


• char 


Q£3 - "3*1413"; 




ill! 


ill 


double d/- ato£0 ; 






llil 


ill 


float f; 






iilll 


WM 


caller <) , 






# 


llil 


■■■liiil 








1111 


ill 


d » ata£{C) ; 










la 


$4 f c 


## 


load address of c 






jal 


atof 


## 


call atof 






s.d 


$f0, d 


## 


store result in d 


# 


7 














la 


$4 r c 


## 


load address of c 






jal 


atof 


## 


call atof 






cvt .s.d 


$f4 f $f0 


## 


convert double result to float |l 






s.s ' 


$£4, f 


II 


3t0^ float r^ult iti f ' 






Iw ' 


$32, 20{$ap) 










'lliliilllllllll 


$5p f 24 










|[!ji|||||||||||l!|!!|!i 


$31 










♦end ' 


l;illlllll;lllli? 







c 



Calling an Assembly Language Routine 



This section shows a technique to follow in writing an assembly language routine 
that calls a routine written in a high-level language (Pascal is used in this exam- 
ple). 

1 . Write a facsimile of the assembly language routine you wish to call. In the 
body of the routine, write statements that use the same arguments you intend 
to use in the final assembly language routine. Copy the arguments to global 
variables rather than local variables to make it easy for you to read the result- 
ing assembly language listing. 

Below is the Pascal facsimile of the assembly language program. 
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type 

str = packed array [1 . . 10] of char; 
subr =2 . . 5; 






var 

global__r: real; 
global_c: subr; 
global_s: str; 
global__b: boolean; 






function callee (var r: real; c: subr; s: 
begin 

global__r := r; 
global_c :== c; 
global__s := s; 
callee := c = 3; 
end; 


str) : 


boolean; 



2. Compile the program using the using the compiler options shown below: 

cc — S —0 caller. c 

The — S option causes the compiler to produce the assembly-language list- 
ing; the — O option, though not required, reduces the amount of code gener- 
ated, making the listing easier to read. 

3. After compilation, look at the file caller.s (shown below). The highlighted 
section of the listing shows how the parameters are passed, the execution of 
the call, and how the returned values are retrieved. 
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*go» 



lil^BI8B8 



c 



lllllli^ 

llllllllw^ 



lbu 



lliM 



JiiuJv(iS£L 



lilillMillllilii 



£"2wiv&iwwK ; ! 






llll 



$32 : 



la 

addu 
. set 
addu 

lbu 

addu 

sb 

lbu 

addu 

sb 

lilliil 



31 



mil 



1111111 



and 
111 



$14, global__s 
$15, $sp, 8 

noat 
$24, $15, 10 

$1, 0($15) 
$15, $15, 2 
$1, 0($14) 
$1, -1($15) 
$14, $14, 2 

^13^ $24, $^ 

illlll^P81S8illi 
liillllliillllll 



.JtfL.§$t....$yk.£an^ 



Jj.JThe pointer to "rZ>J*£>:M~MJML 



## For array "s", the caller gives you a 
## pointer at 8 ($sp) . If you want to use 
## it as a call-by-value argument just as 
## Pascal does (that is, if you want to 
## be able to modify a local copy without 
## affecting the global copy) then you 
## must copy it into your stack frame as 
## shown here (the code enclosed in ".set 
## noat" is a tight byte-copying loop) . 
## Otherwise, you may simply use the 
## pointer ;.. provided ..to. .you.. 



( 



# # Re t u r n the., boo 1 e an by leaving i t . in $ 2 



( 
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This chapter describes pseudo op-codes (directives). These pseudo op-codes 
influence the assembler's later behavior. In the text, boldface type specifies a 
keyword and italics represents an operand that you define. 

The assembler has these pseudo op-codes: 



Pseudo-Op 



Description 



.aent name, symno 

.alias regl, reg2 
.alias gpreg,gpreg 

.align expression 



.ascii string [, string]. 



asciiz string [, string]. 



.asmO 



Sets an alternate entry point for the current procedure. 
Use this information when you want to generate infor- 
mation for the debugger. It must appear inside an 
.ent/.end pair. 

Registerl and register2, when used as indexed registers 
to memory may point to same part of memory. 

Indicates that memory reference through the two regis- 
ters will never overlap. The compiler uses this form to 
improve instruction scheduling. 

Advance the location counter to make the expression 
low order bits of the counter zero. 

Normally, the .half, .word, .float, and .double directives 
automatically align their data appropriately. For exam- 
ple, .word does an implicit .align 2 (.double does a 
.align 3). You disable the automatic alignment feature 
with .align 0. The assembler reinstates automatic align- 
ment at the next .text, .data, .rdata, or .sdata directive. 

Labels immediately preceding an automatic or explicit 
alignment are also realigned. For example, foo: .align 
3; .word is the same as .align 3; foo: .wordO . 

Assembles each string from the list into successive loca- 
tions. The .ascii directive does not null pad the string. 
You MUST put quotation marks (") around each string. 
You can use the backslash escape characters. For a list 
of the backslash characters, see Chapter 4. 

Assembles each string in the list into successive loca- 
tions and adds a null. You can use the backslash escape 
characters. For a list of the backslash characters, see 
Chapter 4. 

Tells the assembler's second pass that this assembly 
came from the first pass. (For use by compilers.) 
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Pseudo-Op 



Description 



.bgnb symno 



(For use by compilers.) Sets the beginning of a language 
block. The .bgnb and .endb directives delimit the scope 
of a variable set. The scope can be an entire procedure, 
or it can be a nested scope (for example a "{ } " block in 
the C language). The symbol number symno refers to a 
dense number in a .T file. For an explanation of .T files, 
see the Languages Programmer's Guide. To set the end 
of a language block, see .endb. 



.byte expression! [, expression! y... 



/, expressionN] 



xomm name, expression 



.data 



Truncates the expressions from the comma-separated 
list to 8-bit values, and assembles the values in succes- 
sive locations. The expressions must be absolute. The 
operands can optionally have the form: expression! [ : 
expression! ]. The expression! replicates expression!' $ 
value expression! times. 

Unless defined elsewhere, name becomes a global com- 
mon symbol at the head of a block of expression bytes of 
storage. The linker overlays like-named common 
blocks, using the maximum of the expressions. 

Tells the assembler to add all subsequent data to the 
data section. 



.double expression [ , expression! ] ... 

/, expressionN Initializes memory to 64-bit floating point numbers. 

The operands can optionally have the form: expression! 
[ : expression! ]. The expression! is the floating point 
value. The optional expression! is a non-negative ex- 
pression that specifies a repetition count. The expres- 
sion! replicates expression! 's value expression! times. 
This directive automatically aligns its data and any pre- 
ceding labels to a double-word boundary. You can dis- 
able this feature by using .align 0. 



.end [procjiame] 



.endb symno 



Sets the end of a procedure. Use this directive when you 
want to generate information for the debugger. To set 
the beginning of a procedure, see .ent. 

Sets the end of a language block. To set the beginning 
of a language block, see .bgnb. 



c 



c 
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Pseudo-Op 



Description 



.endr 

.ent procjtame 



Signals the end of a repeat block. To start a repeat 
block, see .repeat. 

Sets the beginning of the procedure procjtame. Use 
this directive when you want to generate information for 
the debugger. To set the end of a procedure, see .end. 



.extern name expression 



.err 



name is a global undefined symbol whose size is as- 
sumed to be expression bytes. The advantage of using 
this directive, instead of permitting an undefined symbol 
to become global by default, is that the assembler can 
decide whether to use the economical $gp-relative ad- 
dressing mode, depending on the value of the -G option. 
As a special case, if expression is zero, the assembler 
refrains from using $gp to address this symbol regard- 
less of the size specified by -G. 

Signals an error. Any compiler front-end that detects an 
error condition puts this directive in the input stream. 
When the assembler encounters a .err, it quietly ceases 
to assemble the source file. This prevents the assembler 
from continuing to process a program that is incorrect. 
(For use by compilers.) 

Specifies the source file corresponding to the assembly 
instructions that follow. For use only by compilers, not 
by programmers; when the assembler sees this, it re- 
frains from generating line numbers for dbx to use un- 
less it also sees .loc directives. 

.float expression! [ , expression! ]... [, expressionN ] 

Initializes memory to single precision 32-bit floating 
point numbers. The operands can optionally have the 
form: expression! [ : expression! ]. The optional ex- 
pression! is a non-negative expression that specifies a 
repetition count. This optional form replicates expres- 
sion!' $ value expression! times. This directive auto- 
matically aligns its data and preceding labels to a word 
boundary. You can disable this feature by using .align 
0. 



.file file jiumber 
file_name_string 
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Pseudo-Op 



Description 



.frame frame-register offset 
return j>cjegister 



.fmask mask offset Sets a mask with a bit turned on for each floating point 

register that the current routine saved. The least-signifi- 
cant bit corresponds to register $f0. The offset is the 
distance in bytes from the virtual frame pointer at which 
the floating point registers are saved. The assembler 
saves higher register numbers closer to the virtual frame 
pointer. You must use .ent before .fmask and only one 
.fmask may be used per .ent. Space should be allocated 
for those registers specified in the .fmask. 

Describes a stack frame. The first register is the frame- 
register, the offset is the distance from the frame register 
to the virtual frame pointer, and the second register is the 
return program counter (or, if the first register is $0, this 
directive shows that the return program counter is saved 
four bytes from the virtual frame pointer). You must use 
.ent before .frame and only one .frame may be used per 
.ent. No stack traces can be done in the debugger with- 
out .frame. 

Makes the name external. If the name is otherwise de- 
fined (by its appearance as a label), the assembler will 
export the symbol; otherwise it will import the symbol. 
In general, the assembler imports undefined symbols 
(that is, it gives them the UNIX storage class "global 
undefined" and requires the linker to resolve them). 

.half expression! [ , expression! ] ... [ , expressionN] 

Truncates the expressions in the comma-separated list to 
16-bit values and assembles the values in successive 
locations. The expressions must be absolute. This di- 
rective can optionally have the form: expression! [ : ex- 
pression! ]. The expression! replicates expression!^ 
value expression! times. This directive automatically 
aligns its data appropriately. You can disable this fea- 
ture by using .align 0. 



.globl name 



Jab label name 



Associates a named label with the current location in the 
program text. (For use by compilers). 



c 



c 



( 
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Pseudo-Op 



.lcomm name, expression 



Aocfilejiumber 
line number 



.mask mask, offset 



Description 



Makes the name's data type bss. The assembler allo- 
cates the named symbol to the bss area, and the expres- 
sion defines the named symbol's length. If a .globl di- 
rective also specifies the name, the assembler allocates 
the named symbol to external bss. The assembler puts 
bss symbols in one of two bss areas. If the defined size 
is smaller than the size specified by the assembler or 
compiler's -G command line option, the assembler puts 
the symbols in the sbss area and uses $gp to address the 
data. 

Specifies the source file and the line within that file that 
corresponds to the assembly instructions that follow. 
The assembler ignores the file number when this direc- 
tive appears in the assembly source file. Then, the as- 
sembler assumes that the directive refers to the most re- 
cent .file directive. When a Joe directive appears in the 
binary assembly language .G file, the file number is a 
dense number pointing at a file symbol in the symbol 
table .T file. For more information about .G and .T 
files, see the Languages Programmer's Guide. (For use 
by compilers). 

Sets a mask with a bit turned on for each general pur- 
pose register that the current routine saved. Bit one cor- 
responds to register $1. The offset is the distance in 
bytes from the virtual frame pointer where the registers 
are saved. The assembler saves higher register numbers 
closer to the the virtual frame pointer. Space should be 
allocated for those registers appearing in the mask. If bit 
zero is set it is assumed that space is allocated for all 31 
registers regardless of whether they appear in the mask. 
(For use by compilers). 
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Description 



noalias regl,reg2 



nop 



•option options 



.repeat expression 



.rdata 
.sdata 
.set option 



Registerl and register2, when used as indexed registers 
to memory will never point to the same memory. The 
assembler will use this as a hint to make more liberal 
assumptions about resource dependency in the program. 

Tells the assembler to put in an instruction that has no 
effect on the machine state. While several instructions 
cause no-operation, the assembler only considers the 
ones generated by the nop directive to be wait instruc- 
tions. This directive puts an explicit delay in the instruc- 
tion stream. 

NOTE: Unless you use ".set noreorder", the reorganizer 
may eliminate unnecessary "nop" instructions. 

Tells the assembler that certain options were in effect 
during compilation. (These options can, for example, 
limit the assembler's freedom to perform branch optim- 
izations.) This option is intended for compiler-generated 
.s files rather than for hand-coded ones. 

Repeats all instructions or data between the .repeat di- 
rective and the .endr directive. The expression defines 
how many times the data repeats. With the .repeat di- 
rective, you CANNOT use labels, branch instructions, or 
values that require relocation in the block. To end a .re- 
peat, see .endr. 

Tells the assembler to add subsequent data into the 
rdata section. 

Tells the assembler to add subsequent data to the sdata 
section. 

Instructs the assembler to enable or to disable certain 
options. Use set options only for hand-crafted assembly 
routines. The assembler has these default options: reor- 
der, macro, and at. You can specify only one option for 
each .set directive. You can specify these .set options: 

The reorder option lets the assembler reorder machine 
language instructions to improve performance. 

The noreorder option prevents the assembler from reor- 
dering machine language instructions. If a machine lan- 
guage instruction violates the hardware pipeline con- 
straints, the assembler issues a warning message. 



c 



c 
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Pseudo-Op 



Description 



.space expression 



.struct expression 



The bopt/nobopt option lets the assembler perform 
branch optimization. This involves moving an instruc- 
tion that is the target of a branch or jump instruction into 
the delay slot; this is perofrmed only if no unpredictable 
side effects can occur. 

The macro option lets the assembler generate multiple 
machine instructions from a single assembler instruction. 

The nomacro option causes the assembler to print a 
warning whenever an assembler operation generates 
more than one machine language instruction. You must 
select the noreorder option before using the nomacro 
option; otherwise, an error results. 

The at option lets the assembler use the $at register for 
macros, but generates warnings if the source program 
uses $at. 

When you use the noat option and an assembler opera- 
tion requires the $at register, the assembler issues a 
warning message; however, the noat option does let 
source programs use $at without issuing warnings. 

The nomove options tells the assembler to mark each 
subsequent instruction so that it cannot be moved during 
reorganization. Because the assembler can still insert 
nop instructions where necessary for pipeline con- 
straints, this option is less stringent than noreorder. 
The assembler can still move instructions from below 
the nomove region to fill delay slots above the region or 
vice versa. The nomove option has part of the effect of 
the "volatile" C declaration; it prevents otherwise inde- 
pendent loads or stores from occurring in a different or- 
der than intended. 

The move option cancels the effect of nomove. 

Advances the location counter by the value of the speci- 
fied expression bytes. The assembler fills the space with 
zeros. 

This permits you to lay out a structure using labels plus 
directives like .word, .byte, and so forth. It ends at the 
next segment directive (.data, .text, etc.). It does not 
emit any code or data, but defines the labels within it to 
have values which are the sum of expression plus their 
offsets from the .struct itself. 
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Description 



(symbolic equate) Takes one of these forms: name = expression or name = 

register. You must define the name only once in the as- 
sembly, and you CANNOT redefine the name. The ex- 
pression must be computable when you assemble the 
program, and the expression must involve operators, 
constants, and equated symbols. You can use the name 
as a constant in any later statement. 

Tells the assembler to add subsequent code to the text 
section. (This is the default) 

Specifies the major and minor version numbers (for ex- 
ample, version 0. 15 would be .verstamp 15). 

(For use by compilers). Describes a register variable by 
giving the offset from the virtual frame pointer and the 
symbol number symno (the dense number) of the sur- 
rounding procedure. 

.word expression! [, expression! ] ... [ , expressionN] 

Truncates the expressions in the comma-separated list to 
32— bits and assembles the values in successive locations. 
The expressions must be absolute. The operands can 
optionally have the form: expression! [ : expression!]. 
The expression! replicates expression!'® value expres- 
sion! times. This directive automatically aligns its data 
and preceding labels to a word boundary. You can dis- 
able this feature by using .align 0. 



.text 



.verstamp major minor 



.vreg register offset symno 



c 



( 
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This chapter provides information on the object file format and has the following 
major topics: 

• An overview of the components that make up the object file, and 
the differences between the object-file format and the UNIX 
System V common object file format (COFF). 

• A description of the headers and sections of the object file. De- 
tailed information is given on the logic followed by the assem- 
bler and link editor in handling relocation entries. 

• The format of object files (OMAGIC, NMAGIC, ZMAGIC, and 
LIBMAGIC), and information used by the system loader in load- 
ing object files at run-time. 

• Archive files and link editor defined symbols. 

Overview 

The assembler and the link editor generate object files in the order shown in the 
figure on the next page. Any areas empty of data are omitted, except that the File 
Header, Optional Header, and Section Header are always present. 

The fields of the Symbol table portion (indicated in figure on the next page) that 
appear in the final object file format vary, as follows: 

• The Line Numbers, Optimization Symbols, and Auxiliary Sym- 
bols tables appear only when debugging is on (when the user 
specifies one of the compiler — gl, — g2 or — g3 options ). 

• When the user specifies the — x option (strip non-globals) for 
the link edit phase, the link editor strips the Line Number, Local 
Symbols, Optimization Symbols, Auxiliary Symbols, Local 
Strings, and Relative File Descriptor tables from the object file, 
and updates the Procedure Descriptor table. 

• The link editor strips the entire Symbol table from the object file 
when the user specifies the — s option (strip) for the link edit 
phase. 

Any new assembler or link editor designed to work with the compiler system 
should lay out the object file in the order shown in the figure on the next page. 
The link editor can process object files that are ordered differently, but perform- 
ance may be degraded. 
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File Header 



Optional Headers 



Section Headers 



Section Data 



text 

initialization text 
read-only data 
large data 
8-byte literal pool 
4-byte literal pool 



small data 
small bss (0 size) 
large bss (0 size) 
shared library info, 
ucode (ucode ob- 
jects only) 



Section Relocation Information 



text 
read-only data 



large data 
small data 



Symbolic Header 



i; i>M^^ 






Procedure Descriptor Table 



' LocalSymboh 



$$^^ 



'^^^^^^^^ 



&£*^ 



External Strings 



File Descriptor 



\\\\\\\ \\\\\\\v 



External Symbols 



* Created only if debugging is ON. 
S3 Missing if stripped of non-globals. 



c 



( 



Symbol Table. Missing 
if fully stripped. 



Figure 9 J. Object File Format 
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Readers already familiar with standard UNIX System V COFF (common object 
file format) may be interested in the difference between it and the compiler sys- 
tem format, as described next. 

The compiler system File Header definition is based on UNIX System V header 
file filehdr.h with the following modifications. 

• The symbol table file pointer and the number of symbol table 
entries now specify the file pointer and the size of the Symbolic 
Header respectively (described in Chapter 10). 

• All tables that specify symbolic information have their file point- 
ers and number of entries in this Symbolic Header. 

The Optional Header definition has the same format as the UNIX System V 
header file aouthdr.h (the standard UNIX system a.out header) except the follow- 
ing fields have been added: bss_start, gprmask, cprmask, and gp_yalue. See Ta- 
ble 9.4. 

The Section Header definition has the same format as the UNIX System V's 
header file scnhdr.h. except the line number fields are used for global pointers. 
See Table 9.6. 

The relocation information definition is similar to Berkeley UNIX 4.3 BSD, 
which has 'local" relocation types; however, you should read the topic Section 
Relocation Information in this chapter to be aware of differences that do exist. 

For further information on UNIX System V COFF format, refer to the UNIX Sys- 
tem V Support Tools Guide. 



The File Header 



Table 9.1 shows the format of the File Header; the header file filehdr.h contains 
its definition. 

Table 9.1 File Header Format 



Declaration 


Field 


Description 


unsigned short 


fjnagic; 


magic number 


unsigned short 


fnscns; 


number of sections 


long 


fjimdat; 


time and date stamp 


long 


f_symptr; 


file pointer to symbolic h 


long 


fjisyms; 


size of symbolic header 


unsigned short 


f_opthdr; 


size of optional header 


unsigned short 


f_flags; 


flags 



The fjymptr points to the Symbolic Header of the Symbol table, and the fjisyms 
gives the size of the header. For a description of the Symbolic Header, see 
Chapter 9. Other fields in the File Header are described in the sections that fol- 
low. 
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File Header Magic Field (fjnagic) 



The magic number in the fjnagic entry in the File Header specifies the target 
machine on which an object file can execute. Table 9.2 shows the values and 
mnemonics for the magic numbers; the header file filehdr.h contain the preproc- 
essor macro definitions. 

Table 9.2 File Header Magic Numbers 



c 



Symbol 



Value 



Description 



Flags (Mlags) 



MPSEBMAGIC 0x0160 

MIPSELMAGIC 0x0162 

SMPSEBMAGIC 0x6001 

SMPSELMAGIC 0x6201 



MIPSEBUMAGIC 0x0180 
MIPSELUMAGIC 0x0182 



big-endian target (headers and tables have same 

byte sex as host machine.) 

little-endian target (headers and 

tables have same byte sex as host machine.) 

big-endian target (headers and tables 

have opposite byte sex as host machine.) 

little-endian target (headers and 

tables have opposite byte sex as host 

machine) 

MIPS big-endian ucode object file 

MIPS little-endian ucode object file 



Tht fjlags field describes the object file characteristics. Table 9.3 describes the 
flags and gives their hexadecimal bit patterns. The table notes those flags that 
don't apply to compiler system object files. 



c 
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Table 93 File Header Flags 



Symbol Value 



Description 



F_RELFLG 
F_EXEC 

FLNNO 

FLSYMS 

F_MINMAL 

FJJPDATE 

F_SWAB 

F AR16WR 



F_AR32WR 

F_AR32W 

F_PATCH 
F NODF 



0x0001 relocation information stripped from file 

0x0002 file is executable (i.e. no unresolved 

external references) 

0x0004 line numbers stripped from file 

0x0008 local symbols stripped from file 

0x0010 Iminimal object file (".m") output of fextract 

0x0020 ! fully bound update file, output of ogen 

0x0040 !file whose bytes were swabbed (in names) 

0x0080 !file has the byte ordering of an AR16WR 

(e.g. 1 1/70) machine (it was created there, or was 

produced by conv) 

0x0100 !file has the byte ordering of an AR32WR machine 

(e.g. vax) 

0x0200 !file has the byte ordering of an AR32W machine 

(e.g. 3b,maxi,MC68000) 

0x0400 !file contains "patch" list in Optional Header 

0x0400 ! (minimal file only) no decision functions for 

replaced functions. 



!Not used by compiler system object modules. 



Optional Header 



The link editor and the assembler fill in the Optional Header, and the system 
(kernel) loader (or other program that loads the object module at run-time) uses 
the information it contains, as described in the section Loading Object Files in 
this chapter. 

Table 9.4 shows the format of the Optional Header; the header file aouthdr.h 
contains its definition. 
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Table 9.4 Optional Header Definition 



c 



Declaration 


Field 


Description 


short 


magic; 


See Table 9.5. 


short 


vstamp; 


version stamp 


long 


tsize; 


text size in bytes, padded to 16-byte 
boundary 


long 


dsize; 


initialized data in bytes, padded to 
16-byte boundary 


long 


bsize; 


uninitialized data in bytes, padded to 
16-byte boundary 


long 


entry; 


entry point 


long 


textjstart; 


base of text used for this file 


long 


data_$tart; 


base of data used for this file 


long 


bss_start; 


base of bss used for this file 


long 


gprmask; 


general purpose register mask 


long 


cprmask[4]; 


co-processor register masks 


long 


gp_value; 


the gp value used for this object 



The next section describes the magic field in the Optional Header. 

Optional Header Magic Field (magic) 

Table 9.5 shows the values of the magic field for the Optional Header; the header 
file aouihdrh contains the preprocessor macro definitions. 

Table 95 . UNIX Magic Numbers 



( 



Symbol Value 



Description 



OMAGIC 0407 

NMAGIC 0410 

ZMAGIC 0413 

L1BMAGIC 0443 



Impure Format. The text is not write-protected or sharable; the 
data segment is contiguous with the text segment. 

Shared Text. The data segment starts at the next page following 
the text segment and the text segment is write-protected. 

The object file is to be demand loaded and has a special format; the 
text and data segments are separated. Text segment is also write 
protected. (The MIPS default). 

The object file is a target shared library to be demand loaded and 
file has a special format like that of a ZMAGIC file. 



( 
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See the Object Files section in this chapter for information on the format of 
OMAGIC, NMAGIC, ZMAGIC, and LIBMAGIC files. 



Section Headers 



Table 9.6 shows the format of the Section Header; the header file scnhdr.h. con- 
tains the definition. 

Table 9.6 Section Header Format 



Declaration 


Field 


Description 


char 


s_name[8]; 


section name 


long 


s_paddr; 


physical address 


long 


s_vaddr; 


virtual address 


long 


sjsize; 


section size 


long 


sjscnptr; 


file pointer to raw data for section 


long 


s_relptr; 


file pointer to relocation 


long 


sjnnoptr, 


file pointer to gp (global pointer) tables 


unsigned short 


s_nreloc; 


number of relocation entries 


unsigned short 


s_nlnno; 


number of gp tables 


long 


sjlags; 


flags 



The sections that follow describe in detail some of the entries in the Section 
Header. 



Section Name (s_name) 



Table 9.7 shows the constants for section names that can appear in the sjiame 
field of the Section Header; the header file scnhdr.h contains the preprocessor 
macro definitions. 

Table 9.7 Section Header Constants for Section Names 



Declaration 


Field 


Description 


TEXT 


".text" 


text section 


INIT 


".init" 


the initialization text section for shared libraries 


RDATA 


".rdata" 


read only data section 


DATA 


".data" 


large data section 


_LIT8 


".1*8" 


the 8 byte literal pool section 


_LIT4 


".lit4" 


the 4 byte literal pool section 


SDATA 


".sdata" 


small data section 


_BSS 


".bss" 


large bss section 


SBSS 


".sbss" 


small bss section 


_LIB 


".lib" 


the shared library information section 


_UCODE 


".ucode" 


the ucode section 
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Flags (s_flags) 



Table 9.8 shows the flags that appear in sjlags; the header file schdrM contains 
their definition (those flags that are not used by compiler system object files are 
noted). 

Table 9.8 Format of sjlags Section Header Entry 



Symbol 



Value 



Description 



c 



STYP.REG 


0x00 


STYP.DSECT 


0x01 


STYP NOLOA 


0x02 


STYP.GROUP 


0x04 


STYP PA 


0x08 


STYP_COPY 


0x10 


mally 




STYP TEXT 


0x20 


STYP INIT 


0x80000000 


STYP RDATA 


0x100 


STYP_DATA 


0x40 


STYP LIT8 


0x08000000 


STYP_LIT4 


0x10000000 


STYP_SDATA 


0x200 


STYP_SBSS 


0x80 


STYP_BSS 


0x400 


STYP LIB 


0x40000000 


STYPJJCODE 


0x800 


S_NRELOC_OVFL 0x20000000 



regular section; allocated, relocated, loaded 
! dummy; not allocated, relocated, not loaded 
Inoload; allocated, relocated, not loaded 
! grouped; formed of input sections 
Ipadding; not allocated, not relocated, loaded 
!copy; for decision function used by field 
update; not allocated, not relocated, loaded; 
relocated, and line number entries processed nor- 

text only 

section initialization text only 

read only data only 

data only 

section 8 byte literals only 

section 4 byte literals only 

small data only 

contains small bss only 

bss only 

section contains shared library information only 

section contains ucode only 

sjireloc overflowed, the value is in r_vaddr of 

the first entry 



\Not used by compiler system object modules. 



The last flag in the above table, S^NRELOC^OVFL, is used when the number of 
relocation entries in a section overflows the sjireloc field of the section header. 
Then, the sjireloc field contains the value Oxffff and the sjlags field has the 
S_NRELOC_OVFL flag set; the true value is in the rjaddr field of the first re- 
location entry for that section. That relocation entry has a type of R_ABS and all 
other fields are zero, causing it to be ignored under normal circumstances. 

NOTE: For performance reasons, the link editor uses the sjlags entry instead 
of s__name to determine the type of section. However, the link editor does cor- 
rectly fill in the sjiame entry. 



c 



( 
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Global Pointer Tables 



The gp (global pointer) tables are part of the object file that is produced by the 
assembler. These are used by the link editor in calculating the best -G num to 
compile the objects are specified as recompilable by the -count option. There is 
a gp table for the small data and .bss sections only. 

The gp table gives the section size corresponding to each applicable value speci- 
fied by the -G num option (always including 0), sorted by smallest size first. 
The sjnnoptr field in the section header points to this value and the sjilnrto field 
contains the number of entries (including the header). The gp table exists only 
for the .sdata and .sbss sections sections. If there is no " small" section, the re- 
lated gp table is attached to the corresponding 'large" section to provide the link 
editor with this information. 

A design flaw exist because of literal pools. When an object does not contain a 
data and bss section, the -G num option specified for the object at compilation is 
unknown. Because the size of the literal pools cannot be known, this complicates 
the calculation of a best -G num. However, a reliable calculation can be made 
when there is an 8-byte literal pool, which ensures that the object was compiled 
with a -G of at least eight. 

The following code shows the global pointer table definition: 

union gp__table { 
struct { 

long current_g_value; /* actual value */ 

long unused; 
} header; 

struct { 

long g_value; /* hypothetical value */ 

long bytes; /* section size of hypothetical value */ 

} entry; 

}; 



Shared Library Information 



The lib section contains the shared libraries used by executable objects. The ab- 
sence of a Mb section header indicates that no shared libraries are used. Shared 
libraries are a feature of System V; thus, only objects that execute on System V 
should contain lib sections. The field sjilib in the section header is defined to 
be the same as sjpaddr and contains the number of shared library entries in the 
Mb section. The shared library information definition shown below defines a 
compiler system lib section entry. Note the size and offset are in sizeof(long)'s 
not bytes. The size (in bytes) of each entry must be a multiple of SCNROUND. 
The name the offset field refers to is a standard 'C null-terminated string. 
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Section Data 



struct libscn { 

long size; /* size of this entry (including target 
name) */ 

Ipng offset; /* offset from start of entry to target 
name*/ 

long tsize; /* text size in bytes*/ 
long dsize; /* initialized data size in bytes*/ 
long bsize; /* uninitialized data size in bytes*/ 
long text_start; /* base of text used for this li- 
brary*/ 

long data__start; /* base of data used for this library*/ 
long bss_start; /* base of bss used for this library*/ 
/* pathname of target shared library */ 
}; 



( 



RISCompilert system files are represented in the following sections: .text, .init, 
(shared library initialization tcxt).rdata (read-only data), .data (data) Mt8 
(8-byte literal pool), Mt4 (4-byte literal pool), .sdata (small data), .shss (small 
block started by storage), .bss (block started by storage) .lib (shared library infor- 
mation), and .ucode (intermediate code). The .text section contains the machine 
instructions that are to be executed; the sdata, .data, Mt8 9 Mt4, and .sdata con- 
tain initialized data; and the .sbss, and .bss sections reserve space for uninitial- 
ized data that is created by the kernel loader for the program before execution 
and filled with zeros. 

Figure 9.2. shows the layout of the sections. 



c 







i- 

r bss segment 




.text 


.init 


jdata 


.data 


Jit8 


Jit4 


.sdata 


.sbss 


.bss 







Figure 9.2. Organization of Section Data 
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As noted in Figure 9.2., the sections are grouped into the text segment (contain- 
ing the .text and init sections), the data segment (.rdata, .data, Mt8, Mt4, and 
.sdata) and the bss segment (.sbss and .bss). A section is described in and refer- 
enced through the Section Header, and segments through the Optional Header. 

The link editor references the data shown in Figure 9.2. both as sections and seg- 
ments, through the Section Header and Optional Header respectively. However, 
the system (kernel) loader, when loading the object file at run-time, references 
the same data only by segment, through the Optional Header. 



Section Relocation Information 



This portion of the chapter is divided into the following parts: 

• The format of a relocation table entry and an explanation of its 
fields. 

• The logic followed by the assembler and the link editor is creat- 
ing and updating an entry. 



Relocation Table Entry 



Table 9.9 shows the format of an entry in the Relocation Table; the header file 
reloc.h contains the definition. 

Table 9.9 Format of a Relocation Table Entry 



Declaration 


Field 


long 
unsigned 


r_vaddr; 
r_symndx:24. 




r_reserved:3, 




r_type:4, 
r_extern:l; 



Description 



(virtual) address of an item to be relocated, 
index into external symbols or section 
number; see r_extern below. 

relocation type 

= 1 for an external relocation entry; 

r_symndx is an index into External Symbols. 

= for a local, relocation entry; r_symndx 

is the number of the section containing the 

symbol. 



The sections that follow describe some of the fields shown in Table 9.9 

Symbol Index (r_symndx) and Extern Field (r_extern) 

For external relocation entries, r_extern is set to 1 and r_symnndx is the index 
into External Symbols for this entry. In this case, the value of the symbol is used 
as the value for relocation. 

For local relocation entries, r_extern is set to 0, and r_symndx contains a con- 
stant that refers to a section. In this case, the starting address of the section to 
which the constant refers is used as the value for relocation. 
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Table 9.10 gives the section numbers for r^symndx; the reloc.h file contains their 
preprocessor macro definitions, 

Table 9 JO Section Numbers for Local Relocation Entries 



( 



Symbol 


Value 


Description 


R_SN_TEXT 


1 


.text section 


R SN INIT 


7 


.init section 


R SN RDATA 


2 


.rdata section 


R_SN_DATA 


3 


.data section 


R SN SDATA 


4 


.sdata section 


R SN SBSS 


5 


.sbss section 


R_SN_BSS 


6 


.bss section 


R SN LIT8 




Jit8 section 


R_SN_LIT4 


9 


.lit4 section 



Relocation Type (rjype) 

Table 9.1 1 shows valid symbolic entries for the relocation type (rjype) field; 
the header file reloc.h contains their preprocessor macro definitions. 

Table 9.11 Relocation Types 



Symbol Value 



Description 



( 



R ABS 


0x0 


R REFHALF 


0x1 


R_REFWOR 


0x2 


R_JMPADDR 


0x3 


RJREFHI 


0x4 


R_REFLO 


0x5 


R_GPREL 


0x6 


R LITERAL 


0x7 



relocation already performed. 
16-bit reference to the symbol's virtual address 
32-bit reference to the symbol's virtual address 
26-bit jump reference to the symbol's virtual address 
reference to the high 16-bits of symbol's 
virtual address 

reference to the low 16-bits of symbol's 
virtual address 

reference to the offset from the global pointer to the 
symbol's virtual address 

reference to a literal in a literal pool as an offset from 
the global pointer 



Assembler and Link Editor Processing 



Compiler system executable object modules with all external references defined, 
have the same format as relocatable modules and are executable without re-link 
editing. 

Local relocation entries must be used for symbols that are defined. Therefore, 
external relocations are used only for undefined symbols. Figure 9.3. gives an 
overview of the Relocation Table entry for an undefined external symbol. 



( 
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Relocation Table Entry 



r vaddr 



r_symndx 



r extern=l 



External Symbols 



^J^!i!^! i| ^!iWWM^itW;! i / i> (jM i ! i M i ^ 



value=0 






Section Data 



constant 



Sign-extended to 32 bits. 



Figure 9.3. Relocation Table Entry for Undefined External Symbols 
The assembler creates this entry as follows: 

Sets r_vaddr to point to the item to be relocated. 

Places a constant to be added to the value for relocation at the 
address for the item to be relocated (r_vaddr). 

Sets rjymndx to the index of the External Symbols entry that 
contains the symbol value. 

Sets rjype to the constant for the type of relocation types. Table 
9.1 1 shows the valid constants for the relocation type. 

Sets r_extern to 1. 

NOTE: The assembler always sets the value of the undefined entry in External 
Symbols to 0. It may assign a constant value to be added to the relocated value at 
the address where the location is to be done. If the width of the constant is less 
than a full word, and an overflow occurs after relocation, the link editor flags this 
as an error. 

When the link editor determines that an external symbol is defined, it changes the 
Relocation Table entry for the symbol to a local relocation entry. Figure 9.4. 
gives an overview of the new entry. 
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Relocation Table Entry Section n 0ptional 



r vaddr 



r_symndx 



rjype 
r_extern=6 



Section Data 






constant 



OliYiViXiYiXViViVMVi^K^jjiViiiiiiiiii 
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l '" , v l ada^ ,l ' l ' 
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Section n Data 



> 



Sign-extended to 32 bits. 



WW>W$!Mf$&ty 



symbol location 






mmmmmmmmmm. 
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( 



Figure 9 A. Relocation Table Entry for a Local Relocation Entry 

To change this entry from an external relocation entry to a local relocation entry, 
the link editor: 

• Picks up the constant from the address to be relocated (r_yaddr). 

• If the width of the constant is less than 32 bits, sign-extends the 
constant to 32 bits. 

• Adds the value for relocation (the value of the symbol) to the 
constant and places it back in the address to be relocated. 

• Sets rjsymndx to the section number that contains the external 
symbol. 

• Sets r_extern to 0. 

Examples 

The examples that follow use external relocation entries. 

Example 1: 32-Bit Reference— R REFWORD. This example shows assem- 
bly statements that set the value at location b to the global data value y. 



c 



.globl y 
.data 
b: .word y #R_REFWORD relocation type at address b for symbol y 



c 
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In processing this statement, the assembler generates a relocation entry of type 
RJREFWORD for the address h and the symbol y. After determining the address 
for the symbol y, the loader adds the 32-bit address of y to the 32-bit value at 
location b, and places the sum in location b. The loader handles 16-bit addresses 
(R_REFHALF) in the same manner, except it checks for overflow after determin- 
ing the relocation value. 

Example 2: 26-Bit Jump— RJMPADDR. This example shows assembly 
statements that call routine x from location c. 





.text 


x: 


troutine x 


c: 


jal x #R_JMPADDR relocation type at address c for symbol x 



In processing these statements, the assembler generates a relocation entry of type 
RJMPADDR for the address and the symbol x. After determining the address 
for the routine, the loader shifts the address right two bits, adds the low 26 bits of 
the result to the low 26 bits of the instruction at address c (after sign-extending 
it), and places the results back into the low 26 bits at address c. 

RJMPADDR relocation entries are produced for the assembler's j (Jump) and 
jal (Jump and Link) instructions. These instructions take the high four bits of the 
target address from the address of the delay slot of their instruction. The link 
editor makes sure that the same four bits are in the target address after relocation; 
if not, it generates an error message. 

If the entry is a local relocation type, the target of the Jump instruction is assem- 
bled in the instruction at the address to be relocated. The high four bits of the 
jump target are taken from the high 4 bits of the address of the delay slot of the 
instruction to be relocated. 

Example 3: High/Low Reference-R JtEFHI/R REFLO. This example 
shows an assembler macro that loads the absolute address y, plus a constant, into 
Register 6: 

In processing this statement, the assembler generates a as the value y, and the 
following machine language statements: 



f: 


lui $at, constant»16 


#R_REFHI relocation type 
at address f for symbol y 


g: 


addiu $r6,constant&0xffff ($at) 


#R_REFLO relocation type at 
at address g for symbol y 



In this example, the assembler produces two relocation entries. 

NOTE: When a R_REFHI relocation entry appears, the next relocation entry 
must always be the corresponding R_REFLO entry. This is required in order to 
reconstruct the constant that is to be added to the value for relocation. 

In determining the final constant values for the two instructions, the link editor 
must take into account that the addiu instruction of the R_REFLO relocation en- 
try, sign-extends the immediate value of the constant. 
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In determining the sum of the address for the symbol y and the constant, the link ^ 

editor does the following: \^ 

• It uses the low 1 6 bits of this sum for the immediate value of the 
R_REFLO relocation address. 

• Because all instructions that are marked with a R_REFLO per- 
form a signed operation, the assembler adjusts the high portion 
of the sum if Bit 15 is set. Then it uses the high 16 bits of the 
sum for the immediate value of the R JREFHI instruction at the 
relocation address. For example: 



lw $r6, 0xl00080fl1f" 



lw $r6, 0x8000 ($at) 



0x10008000 



Example 4: Offset Reference — R_GPREL. This example shows an assembly 
macro that loads a global pointer relative value y into Register 6: 

lw $r6,y 

In processing this statement, the assembler generates a as the value y and the 
following machine language statement: 

h: lw $r6,0($gp) #R_GPREL relocation type at ad- 
dress h for symbol y 

and a RJ3PREL relocation entry would be produced. The assembler then uses 
the difference between the address for the symbol y and the address of the global 
pointer, as the immediate value for the instruction. The link editor gets the value 
of the global pointer used by the assembler from gp_value in the Optional Header 
(Table 9.4). 

Example 4: Example of the R LITERAL. This example shows of an RJLIT- 
ERAL uses a floating-point literal. The assembler macro: 

li.s $f0, 1.234 

is translated into the following machine instruction: 

h: Iwcl $f0, -32752 (gp) # R_LITERAL relocation type at 

address h for the literal 1.234 

and a RJLITERAL relocation entry is produced; the value of the literal is put into 
the Mt4 section. The link editor places only one of all like literal constants in the 
literal pool. The difference between the virtual address of the literal and the ad- 
dress of the global pointer is used as the immediate value for the instruction. The 
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Object Files 



link editor handles 8-byte literal constants similarly, except it places each unique 
constant in the Mt8 section. The value of the -G num option used when compil- 
ing determines if the literal pools are used. 



This section describes the object-file formats created by the link editor, namely 
the Impure (OMAGIC), Shared Text (NMAGIC), Demand Paged (ZMAGIC), 
and target-shared libraries (LIBMAGIC) formats. Before reading this section, 
you should be familiar in the format and contents of the text, data, and bss seg- 
ments as described in the Section Data section of this chapter. 

The following rules dictate certain constraints on the address at which an object 
can be loaded and the boundaries of its segments; the operating system can dic- 
tate additional constraints. 

1. OMAGIC, NMAGIC or ZMAGIC object files can execute on UMIPS-BSD 
or UMIPS-V systems. 

2. Segments must not overlap and all addresses must be less than 0x80000000. 

3. Space should be reserved for the stack, which starts below 0x80000000 and 
grows through lower addresses, that is, the value of each subsequent address 
is less than that of the previous address, 

4. The default text segment address for ZMAGIC and NMAGIC files is 
0x00400000 and the default data segment address is 0x10000000. 

5. The default text segment address for OMAGIC files is 0x10000000 with the 
data segment following the text segment. 

6. Don't specify the -B num option (specifying a bss segment origin) for 
OMAGIC files; the default, which specifies that the bss segment follow the 
data segment, must be used. 

7. UMIPS-BSD requires a 4-megabyte boundary for segments. Objects linked 
at addresses other than the default will run under the Release 2.0 and later 
UMIPS-BSD releases. 

8. UMIPS-V requires a 2-megabyte boundary for segments. OMAGIC object 
files execute under UMIPS-V Release 1.2 and later release. 
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Impure Format (OMAGIC) Files 

An OMAGIC file has the format shown in Figure 9.5. 



c 











r bss segment 
> data segment 
r text segment 




.bss 


] 

4 

1 


a 




.sbss 


.sdata 


.lit4 


.lit8 


.data 


.rdata 


.init 


.text 


ligned 


on a 16-byte boundary 



Figure 9.5. Layout of OMAGIC Files in Virtual Memory 

The OMAGIC format has the following characteristics: 

• Each section follows the other in virtual address space aligned on 
an 16-byte boundary. 

• No blocking of sections. 

• Text, data and bss segments can be placed anywhere in the vir- 
tual address space using the link editor's — T, — D and — B op- 
tions. 

• The addresses specified for the segments must be rounded to 
16-byte boundaries. 

• The text segment contains only the .text and init sections. 

• The sections in the data segment are ordered as follows: .rdata, 
.data, iit8, iit4, and .sdata 

• The sections in the bss segment are ordered as follows: .shss and 
.hss. 



c 
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Shared Text (NMAGIC) Files 

An NMAGIC file has the format shown in Figure 9.6. 



.bss 



.sbss 



.sdata 



.1H4 



.lit8 



.data 



.rdata 



.init 



.text 



} 



bss segment 



> data segment 



text segment 



-aligned on a 16-byte boundary 
aligned on a page-size boundary 



Figure 9.6. Layout of NMAGIC Files in Virtual Memory 

An NMAGIC file has the following characteristics: 

• The virtual address of the data segment is on a pagesize bound- 
ary. 

• No blocking of sections. 

• Each section follows the other in virtual address space aligned on 
an 16-byte boundary. 

• Only the start of the text and data segments, using the link edi- 
tor's — T and — D options, can be specified for a shared text for- 
mat file; the start of the text and data segments must be a multi- 
ple of the pagesize. 
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Demand Paged (ZMAGIG) Files 

A ZMAGIC file is a demand, paged file in the format shown in Figure 9.7. 

A ZMAGIC file has the following characteristics: 

• The text segment and the data segment are blocked, with 

pagesize as the blocking factor. Blocking reduces the complex- 
ity of paging in the files. 



c 



2GI 

2G-32K ] 
1 

fill area 

256M 

4 Mbyte +• header 
4 Mbyte 






1 32K (not accessible 
1 by user). 

atf .,„ bottom of st^ck 


uoase 1 
Stack Area $ 


sbrk arena 


increases automatically 
as required. 

r bss segment 

► data segment 

(blocked by pagesize) 

1 text segment 

| (blocked by pagesize) 


.bss 


.sbss 
0areaM 


.sdaia 


Jit4 


^^S^$ 


.data 


^^^^^^^^! 


empty 


•filtama' . . 


init 


^ext 


ll:|!l:Ill|ll^llllll!;lli| 


empty 





Figure 9.7, Layout of ZMAGIC Files in Virtual Memory 



c 



( 
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• The size of the sum of the of the File, Optional, and Sections 
Headers (Tables 9.1, 9.4, and 9.6) rounded to 16 bytes is in- 
cluded in blocking of the text segment. 

• The text segment starts by default at 0x400000 (4 Mbyte) , plus 
the size of the sum of the headers again rounded to 16 bytes. 
With the standard software, the text segment starts at 0x400000 
+ header size. 

NOTE: This is required because the first 32K bytes of memory 
are reserved for future use by the compiler system to allow data 
access relative to the constant register 0. 

• Only the start of the text and data segments, using the link edi- 
tor's — T and — D options can be specified for a demand paged 
format file and must be a multiple of the pagesize. 

Figure 9.8. shows a ZMAGIC file as it appears in a disk file. 











Symbol Table 


I data segment 

(blocked by pagesize) 

4 

Y text segment 

1 (blocked by pagesize) 


Fill Area 


.sdata 


Jit4 


Jit8 


.data 


.rdata 


fill area 


.text 


.init 


headers 









Figure 9.8. Layout of a ZMAGIC File on Disk. 
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Target Shared Library (LIBMAGIC) Files 



Typically, mk$hlib(l) creates target shared libraries; however, the link editor cre- 
ates such libraries when its -c option is specified (each shared library file name is 
displayed during the link if the -v option is supplied) . 

LIBMAGIC files are demand paged and have the same format as ZMAGIC file 
except as follows: 

• headers are put on their own page 

• the xt section starts on the next page from the value of the -T 
num option. This prevents the number and size of headers from 
affecting the start of the first real text. The first real text is the 
branch table and must stay at the same address. 

Both the -T and -D options should be specified, because the defaults would 
cause the target shared library to overlay the ZMAGIC files and cause an execu- 
tion failure. The link editor -c option requires that the files to be linked are com- 
piled with the -G option (which sets the link editor -G option). 



Objects Using Shared Libraries 



Ucode objects 



c 



( 



Object files that use shared libraries contain a Mb section following the data seg- 
ment (including the zero fill area created by blocking it to zpagesize). All object 
files on System V systems contain an init section used by shared library initiali- 
zation code. Shared library initialization instructions are generated by mkshlih(l) 
from the #init directive in the library specification file. This following code from 
the shared library specification 

#init bar.o 

__libfoo__ext ext 

generates these instructions generated in the .init section: 

la $2, ext 

3w $2,__libfoo_ext 

Initialization instructions are not bounded by any procedure; the initialization 
instructions from each .init section are concatenated and the runtime startup 
(crtl.o) branches to its label in its .init section. Then the execution falls through 
all the concatenated .init sections until reaching crtn.o (the last object with a .init 
section) which contains the RETURN instruction. 

Object files without shared libraries contain a small .init section that executes, 
producing no significant results. 



Ucode objects contain only a file header, the ucode section header, the ucode sec- 
tion and all of the symbolic information; A ucode section never appears in a ma- | 
chine code object file. ^ 
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Loading Object Files 

The link editor produces object files with their sections in a fixed order similar to 
UNIX system object files that existed before COFF. See Figure 9. 1 for the a de- 
scription of the sections and how they are formatted. 

The sections are grouped into segments, which are described in the Optional 
Header. In loading the object module at run-time, the system (kernel) loader 
needs only the magic number in the File Header and the Optional Header to load 
an object file for execution. 

The starting addresses and sizes of the segments for all types of object files are 
specified similarly, and they are loaded in the same manner. 

After reading in the File Header and the Optional Header, the system (kernel) 
loader must examine the file magic number to determine if the program can be 
loaded. Then, the system (kernel) loader loads the text and data segments. 

The starting offset in the file for the text segment is given by the macro 

N_TXTOFF(f,a) 

in the header file a.out.h, where/is the File Header structure and a is the option 
header structure for the object file to be loaded. The tsize field in the Optional 
Header (Table 9.4) contains the size of the text segment and textjtart contains 
the address at which it is to be loaded. 

The starting offset of the data segment follows the text segment. The dsize field 
in the Section Header (Table 9.6) contains the size of the data segment; 
data_start contains the address at which it is to be loaded. 

The system (kernel) loader must fill the bss segment with zeros. The bssjtart 
field in the Optional Header specifies the starting address; bsize specifies the 
number of bytes to be filled with zeros. In ZMAGIC files, the link editor adjusts 
bsize to account for the zero filled area it created in the data segment that is part 
of of the .sbss or .bss sections. 

If the object file itself does not load the global pointer register it must be set to 
the gp_value field in the Optional Header (Table 9.4). 

The other fields in the Optional Header are gprmask and cprmask[4], whose bits 
show the registers used in the .text and .init sections. They can be used by the 
operating system, if desired, to avoid save register relocations on context-switch. 



Archive files 



The link editor can link object files in archives created by the archiver. The ar- 
chiver and the format of the archives are based on the UNIX System V portable 
archive format. To improve performance, the format of the archives symbol ta- 
ble was changed so that it is a hash table, not a linear list. 

The archive hash table is accessed through the ranhashinit() and ranlookupO li- 
brary routines in libmld.a, which are documented in the manual page ran- 
hash(3x). The archive format definition is in the header file ar.h. 
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Link Editor Defined Symbols 



Certain symbols are reserved and their values are defined by the link editor. A 
user program can reference these symbols, but not define one, or else an error is 
generated. Table 9.12 lists the names and values of these symbols; the header 
file sym.h contains their preprocessor macro definitions. 

Table 9.12 Link Editor Defined Symbols 



c 



Symbol 


Value 


Description 


ETEXT 


"etext" 


1st location after .text 


_EDATA 


"edata" 


1st location after .sdata 
(all initialized data) 


JEN 


"end" 


1st location after .bss (all data) 


FTEXT 


" ftext" 


! 1st location of .text 


FDATA 


" fdata" 


! 1st location of .data 


FBSS 


" fbss" 


! 1st location of the .bss 


PROCEDURE TABLE 


"_j>rocedurej:able" 


runtime procedure table 


.PROCEDURE TABLE SIZE 


"jprocedure.table.size" 


runtime procedure table size 


.PROCEDURE STRING TABLE 


"_procedure_string_table" 


string table for runtime proc. 


COBOL MAIN 


"_coboLmain" 


1st cobol main symbol 


J3P 


-gP 


! the value of the global 
pointer 


! compiler system only. 







The first three symbols come from the standard UNIX system link editors and the 
rest are compiler system specific. The last symbol is used by the start up routine 
to set the value of the global pointer, as shown in the following assembly lan- 
guage statements: 



( 



globl _GP 

la $gp, GP 



The assembler generates the following machine instructions for these statements: 



a: lui gp,0 # RJREFHI relocation type at address a for symbol _GP 
b: add gp,0 # R_REFLO relocation type at address b for symbol _GP 



which would cause the correct value of the global pointer to be loaded. 

The link editor symbol _COBOL_MAIN is set to the symbol value of the first 
external symbol with the cobol jnain bit set. COBOL objects uses this symbol to 
determine the the main routine. 



( 
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Runtime Procedure Table Symbols 

The three link editor defined symbols, _PROCEDURE_TABLE, __PROCE- 
DURE_TABLE_SIZE and PROCEDURE JSTRINGJTABLE, relate to the run- 
time procedure table. The Runtime Procedure Table is used by the exception 
systems in ADA, PL/I and COBOL. Its description is found in the header file 
<sym.h>. The table is a subset of the Procedure Descriptor Table portion of the 
Symbol Table with one additional field, exception Jnfo. 

When the procedure table entry is for an external procedure, and an External 
Symbol Table exists, the link editor fills in exception Jnfo with the address of the 
external table. Otherwise, its fill in exception Jnfo with zeros. 

The name of the External Symbol Table is the procedure name concatenated with 
the string _exceptionJnfo (actually, the preprocessor macro EXCEPTION_SUF- 
FIX as defined in the header file <exception.h>). 

The Runtime Procedure Table provides enough information to allow a program 
to unwind its stack. It is typically used by the routines in libexc.a. The comments 
in the header file <exception.h> describes the routines in that library. 

The Runtime Procedure Table is sorted by procedure address and always has a 
dummy entry with a zero address and a Oxffffffff address. When required, the 
table is padded with an extra zero entry to ensure that the total number of entries 
is an uneven (odd) number. 

The Runtime Procedure Table and String Table for the runtime procedure table 
are placed at then end of the .data section in the object file. 
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Symbol Table 



This chapter describes the symbol table and symbol table routines used to create 
and make entries in the table. The chapter contains the following major sections: 

• Overview, which gives the purpose of the Symbol table, a sum- 
mary of its components, and their relationship to each other. 

• Format of Symbol Table Entries, which shows the structures of 
Symbol table entries and the values you assign them through the 
Symbol Table routines. 

• Symbol Table Routine Reference, which lists the symbol table 
routines supplied with the compiler and summarizes the function 
of each. 

NOTE: Third Eye Software, Inc. owns the copyright (dated 1984) to the format 
and nomenclature of the Symbol Table used by the compiler system as docu- 
mented in this chapter. 

Third Eye Software, Inc. grants reproduction and use rights to all parties, PRO- 
VIDED that this comment is maintained in the copy. 

Third Eye makes no claims about the applicability of this symbol table to a par- 
ticular use. 
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Overview 



The symbol table in created by the compiler front-end as a stand-alone file. The 
purpose of the table is to provide information to the link editor and the debugger 
in performing their respective functions. At the option of the user, the link editor 
includes information from the Symbol table in the final object file for use by the 
debugger. See Figure 9. 1 in Chapter 9 for details. 



c 
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c 



Figure 10 J The Symbol Table - Overview 

The elements that make up the Symbol table are shown in Figure 10.1 . The 
front-end creates one group of tables (the shaded areas in Figure 10.1 ) that con- 
tain global information relative to the entire compilation, It also creates a unique 
group of tables (the unshaded areas in the figure) for the source file and each of 
its include files. 



( 
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Symbol Table 

Compiler front-ends, the assembler, and the link editor interact with the symbol 
table as summarized below: 

• The front-end, using calls to routines supplied with the compiler 
system, enters symbols and their descriptions in the table. 

• The assembler fills in line numbers, optimization symbols, up- 
dates Local Symbols and External Symbols, and updates the Pro- 
cedure Descriptor table. 

• The link editor eliminates duplicate information in the External 
Symbols and the External Strings tables, removes tables with 
duplicate information, updates Local Symbols with relocation 
information, and creates the Relative File Descriptor table. 

The major elements of the table are summarized in the paragraphs that follow. 
Some of these elements are explored in more detail later in the chapter. 

Symbolic Header. The Symbolic Header (HDRR for HeadDeR Record) con- 
tains the sizes and locations (as an offset from the beginning of the file) of the 
subtables that make up the Symbol Table. Figure 10.2 shows the symbolic rela- 
tionship of the header to the other tables. 



Symbolic Header 





Line Numbers 




Dense Numbers 




Procedure 
Descriptor Table 




Local Symbols 




Optimization Symbols 


Auxliary Symbols 




Local Strings 


External Strings 




File Descriptor Table 



Figure 102 Functional Overview of the Symbolic Header 

Line Numbers. The assembler creates the Line Number table. It creates an en- 
try for every instruction. Internally, the information is stored in an encoded form. 
The debugger uses the entries to map instruction to the source lines and vice 
versa. 

Dense Numbers. The Dense Number table is an array of pairs. An index into 
this table is called a dense number. Each pair consists of a file table index (ifd) 
and an index (isyrn) into Local Symbols. The table facilitates symbol look-up for 
the assembler, optimizer, and code generator by allowing direct table access 
rather than hashing. 
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Procedure Descriptor Table. The Procedure Descriptor table contains register 
and frame information, and offsets into other tables that provide detailed infor- 
mation on the procedure. The front-end creates the table and links it to the Local 
Symbols table. The assembler enters information on registers and frames. The 
debugger uses the entries in determining the line numbers for procedures and 
frame information for stack traces. 

Local Symbols, The Local Symbols table contains descriptions of program vari- 
ables, types, and structures, which the debugger uses to locate and interpret run- 
time values. The table gives the symbol type, storage class, and offsets into other 
tables that further define the symbol. 

A unique Local Symbols table exists for every source and include file; the com- 
piler locates the table through an offset from the file descriptor entry that exists 
for every file. The entries in Local Symbols can reference related information in 
the Local Strings and Auxiliary Symbols subtables. This relationship is shown in 
Figure 10.3 . 



c 



^FileD^H^^^|^3 



Entry for File 



Entry for File 1 



Entry for File 2 



Entry for File n 



- t ■^^^Y^^'^iMi& l ;,m^;^^ 



Local Swnbols 



Jpj>caLStrings I 
Maries 



Local Strings I 



"T^^?t rings I 



( 



Figure 10.3 Logical Relationship between the File Descriptor Table and Local Symbols 

Optimization Symbols. To be defined at a future date. 

Auxiliary Symbols. The Auxiliary Symbols tables contain data type information 
specific to one language. Each entry is linked to an entry in Local Symbols. The 
entry in Local Symbols can have multiple, contiguous entries. The format of an 
auxiliary entry depends on the symbol type and storage class. Table entries are 
required only when the compiler debugging option is ON. 

Local Strings. The Local Strings subtables contain the names of local symbols. 



( 
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External Strings. The External Strings table contains the names of external 
symbols. 

File Descriptor. The File Descriptor table contains one entry each for each 
source file and each of its include files. (The structure of an entry is given in Ta- 
ble 10.12 later in this chapter.) The entry is composed of pointers to a group of 
subtables related to the file. The physical layout of the subtables is shown in 
Figure 10.4 . 



r 



File Descriptor Table 



foe Descriptor Entry 



Line Numbers 



Procedure 
DpffirriptorT^hlfi 



Local Symbols 



Optimization Symbols 



Auxliary Symbols 



Local Strings 



Relative File Descriptor 



Figure 10 A Physical Relationship of a File Descriptor Entry to Other Tables 

The file descriptor entry allows the compiler to access a group of subtables 
unique to one file. The logical relationship between entries in this table and in its 
subtables is shown in 10.5. 
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Flic Descriptor Table 



Entry for FiteO 



Entry for File 1 



Entry for File 2 





— 


Line Numbers 


Procedure 


Local Symbols 


Optimisation Symbols 




Auxliary Symbols 




Local Strings 




Relative File Descriptor 






Line Numbers 




Procedure 
DfiSfinjpiff* 1 * Tin hip 




Local Symbols 




Optimization Symbols 




Auxliary Symbols 




Local Strings 




Relative File Descriptor 



c 



c 



Figure 10.5 Logical Relationship between the File Descriptor Table and Other Tables 

Relative File Descriptor, See the section Link Editor Processing later in this 
chapter. 

External Symbols, The External Symbols contains global symbols entered by 
the front-end. The symbols are defined in one module and referenced in one or 
more other modules. The assembler updates the entries, and the link editor 
merges the symbols and resolves their addresses. 
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Format of Symbol Table Entries 
Symbolic Header 



The structure of the Symbolic Header is shown below in Table 10.1 ; the sym.h 
header file contains the header declaration,, 

Table 10.1 Format of the Symbolic Header 



Declaration 



Name 



Description 



short 


magic 


short 


vstamp 


long 


ilineMax 


long 


cbLine 


long 


cbLineOffset 


long 


idnMax 


long 


cbDnOffset 


long 


ipdMax 


long 


cbPdOffset 


long 


isymMax 


long 


cbSymOffset 


long 


ioptMax 


long 


ebOptOffset 


long 


iauxMax 


long 


cbAuxOffset 


long 


issMax 


long 


cbSsOffset 


long 


issExtMax 


long 


cbSsExtOffset 


long 


ifdMax 


long 


cbFdOffset 


long 


crfd 


long 


cbRfdOffset 


long 


iextMax 


long 


cbExtOffset 



to verify validity of the table 

version stamp 

number of line number entries 

number of bytes for line number entries 

index to start of line numbers 

max index into dense numbers 

index to start dense numbers 

number of procedures 

index to procedure descriptors 

number of local symbols 

index to start of local symbols 

maximum index into optimization entries 

index to start of optimization entries 

number of auxiliary symbols 

index to the start of auxiliary symbols 

max index into local strings 

index to start of local strings 

max index into external strings 

index to the start of external strings 

number of file descriptors 

index to file descriptor 

number of relative file descriptors 

index to relative file descriptors 

maximum index into external symbols 

index to the start of external symbols 



The lower byte of the vstamp field contains LSJSTAMP and the upper byte 
MSJJTAMP (see the stamp.h header file). These values are defined in the 
stamp.h file. The iMax fields and the cbOffset field must be set to if one of the 
tables shown in Table 10.1 isn't present. The magic field must contain the con- 
stant magicSym, also defined in longsymconst.h. 
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Line Numbers 



Table 10.2 shows the format of an entry in the Line Numbers table; the sym.h 
header file contains its declaration. 

Table 102 Format of a Line Number Entry 



a 



Declaration 



Name 



typedeflong 



LINER, *pLINER 



The line number section in the Symbol table is rounded to the nearest four-byte 
boundary. 

Line numbers map executable instructions to source lines; one line number is 
stored for each instruction associated with a source line. It is stored as a long in- 
teger in memory and in packed format on disk. 

The layout on disk is as follows: 



Bit 8 




Delta 



Count 



The compiler assigas a line number to only those lines of source code that gener- 
ate one or more executable instructions. 

Delta is a four-bit value in the range — 7...7, defining the number of source lines 
between the current source line, and the previous line generating executable in- 
structions. The Delta of the first line number entry is the displacement from the 
InLow field in the Procedure Descriptor Table. 

Count is a four-bit field with a value in the range 0...15 indicating the number 
(1...16) of executable instructions associated with a source line. If more than 16 
instructions (15+1) are associated with a source line, new line number entries are 
generated with Delta = 0. 

An extended format of the line number entry is used when Delta is outside the 
range of — 7...7. 



c 



( 
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The layout of the extended field on disk is as follows: 




Upper eight bits of Delta 



Bit & 



Lower eight bits of Delta 



J) 



NOTE: The compiler allows a maximum of 32,767 comment lines, blank lines, 
continuation lines and other lines not producing executable instructions, between 
two source lines that do. 

Line number example. This section gives an example of how the compiler 
assigns line numbers. For the source listing shown below, the compiler generates 
line numbers only for the highlighted lines (6, 7, 17, 18, and 19); the other lines 
are either blank or contain comments. 



llllllllll 



:-:•. • : ft ft: ■: : ?ft ft •;■: : : : • -:•. . : :': : :*: .-.v. : : :ft: : : : : : •> v.:-.-;-..-.;-.;-; cftxftx : m ft ft:ft: 
:*: : :*: : : : xftx *?<■.■/.<■>; : : : : : x : x rftXftXftXftXft:: m># vftv>ft SWA? ":'• ft* ■>*&>: 



II 



•XrX.x*x*x*XtX*XiXiXvfXty*XiXiX*XifrX*XiX^ 



6 printf ("this program just prints its inputW); 

7 while ((c = getc(stdin)) != EOF) { 



i u » u 1 1 1 i .u » > u 1 1 1 i i in 1 1 i.i.i i i j i 1 1 1 ij.i 1 1 i.i.i i .n.1. 1 1 1 1 M.'A'.W.'.W. WA^WA WAMAU.WAWMTOCTJJJ 'J X 'i 'M'W 

^IIBHiiMllBll^^llia 

H ** i 

17 
18 
19 



} /* end while */ 
} I* end main */ 



printf("%c",c); 



Figure 10.6 Source Listing for Line Number Example 
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Figure 10.8 (on the next page) shows the instructions generated for lines 3, 7, 17, 
18, and 19. Figure 10.7 (below) shows the compiler-generated liner entries for 
each source line. 



c 



Source 
Line 


Liner 


Contents 


Meaning 


3 

6 

7 

7 

17 1 

18 

18 2 

19 


02 

31 

If 

03 

82000a 

If 

03 

15 


delta 0, count 2 
delta 3, count 1 
delta 1, count 15 
delta 0, count 3 
-8, count 2, delta 10 
delta 1, count 15 
delta0,£ount3 
delta 1, count 5 


Extended format (count is greater than 
seven lines). 

2 Continuation. 



Figure 10 J Source Listing for Line Number Example 
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main:3, 0x4001a0] 
main:3, 0x4001a4 
main:3, 0x4001a8] 
main:6, 0x4001ac 
main:6, 0x4001b6 



addiu 

sw 

sw 

jal 
addiu 



sp,sp,-32 
r31,20(sp) 
rl6,16(sp) 
printf 

r4 -Pr327_52 




main:7, 
main:7, 
main:7, 
main:7, 
main:7, 
main:?, 
jnain:7, 
main:7, 
main:7, 
inain:7, 
main:7, 
jmain:7, 
main:7, 
main:7, 
;nain:7, 
main:7, 
main:7, 
main:7, 
[main:7, 
1 



Cx"4(X)lb4 

0x4001b8 

0x4001bc 

0x4001c0 

0x4001c4] 

0x4001c8 

0x4001cc 

0x4001d0] 

0x4001d4] 

0x4001d8] 

0x4001dc] 

0x4001e0] 

0x4001e4] 

0x4001e8] 

0x4001ec] 

0x4001f0] 

0x4001f4] 

0x4001f8] 

0x4001fc] 



ImaiirZJ]x400?,00] 



Tw 
nop 
addiu 
bltz 
sw 
lw 
nop 
lbu 
addiu 
sb 
b 

sw 
jal 

addiu 
move 
sb 
lbu 
li 

beq 
""P 



rl5,rl4,-l 
rl5,0x4001e4 
rl5-32552(gp) 
r24,-32548(gp) 

i25,0(i24) 

r8,r24,l 

i2531(sp) 

0x4001f4 

r8,-32548(gp) 

_fflbuf 

r4,gp,-32552 

rl6j2 

rl6,31(sp) 

r931(sp) 

rl-1 

r9,rl, 0x400260 




[main: 17, 0x400204 

[main: 17, 0x400208; 

main: 17, 0s 



lbu 

jal 
addiu 



r5,31(sp) 
printf 

r4,gp-32716 
rlO,-32552{gp) 




main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
main: 
jnain 



wszmm 

18,0x400214 
18, 0x400218 
18, 0x40021c] 
18,0x400220] 
18, 0x400224 
18,0x400228 
18, 0x40022c 
18, 0x400230 
18,0x400234] 
18,0x400238 
18, 0x40023c 
18, 0x400240 
18, 0x400244 
18,0x400248] 
18,0x40024c] 
18,0x400250] 
18,0x400254] 
18,0x400258] 
jg, 0x400?,5r,] 



Tw" 
nop 
addiu 
bltz 
sw 
lw 
nop 
lbu 
addiu 
sb 
b 
sw 

jal 

addiu 

move 

sb 

lbu 

li 

bne 

nop 



rll,rl0,-l 
rl 1,0x400240 
rll,-32552(gp) 
rl2,-32548(gp) 

rl3,0(rl2) 

rl4,rl2,l 

rl3,31(sp) 

0x400250 

rl4,-32548(gp) 

_fflbuf 

r4,gp,-32552 

rl6j2 

rl6,31(sp) 

rl5,31(sp) 

rl,-l 

rl5/l ,0x400204 




mam: 
[main: 
[main: 
[main: 
[main: 
[main: 



19,0x400260] 
19,0x400264] 
19,0x400268] 
19,0x40026c] 
19,0x400270] 
19,0x400274] 



b 
nop 
lw 
lw 

jr 
addiu 



0x400268 

r31,20(sp) 
rl6,16(sp) 
r31 
sp,sp,32 




Figure 10.8 Source Listing for Line Number Example 
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Procedure Descriptor Table 



Table 10.3 shows the format of an entry in the Procedure Descriptor table; the 
sym.h header file contains its declaration. 



( 



Table 10 J Format of a Procedure Descriptor Table Entry 



Declaration 


Name 


Description 


unsigned, long 


adr 


memory address of start of procedure 


long 


isym 


start of local symbols 


long 


iline 


procedure's line numbers 


long 


regmask 


saved register mask 


long 


regoffset 


saved register offset 


long 


iopt 


procedure's optimization symbol entries 


long 


fregmask 


save floating point register mask 


long 


fregoffset 


save floating point register offset 


long 


frameoffset 


frame size 


long 


frameieg 


frame pointer register 


long 


pcreg 


index or reg of return program 
counter 


long 


InLow 


lowest line in the procedure 


long 


InHigh 


highest line in the procedure 


long 


cbLineOffset 


byte offset for this procedure from 
the base of the file descriptor entry. 



Local Symbols 



C 



Table 10.4 shows the format of an entry in the Local Symbols table; the sym.h 
header file contains its declaration. 

Table 10 A Format of a Local Symbols Entry. 



Declaration 


Name 


Description 


long 


iss 


index into local strings of symbol name 


long 


value 


value of symbol. See Table 10.5. 


unsigned 


st:6 


symbol type. See Table 10.6. 


unsigned 


sc: 5 


storage class. See Table 10.7. 


unsigned 


reserved : 1 




unsigned 


index : 20 


index into local or auxiliary symbols 
See Table 3,5, 



The meanings of the fields in a local symbol entry are explained in the following 
paragraphs. 



( 
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iss. The iss (for index into string space) is an offset from the issBase field of an 
entry in the file descriptor table, to the name of the symbol. 

value. An integer representing an address, size, offset from a frame pointer. The 
value is determined by the symbol type, as illustrated in Table 10.5. 

standsc. The symbol type (st) defines the symbol; the storage class (sc), where 
applicable explains how to access the symbol type in memory. The valid st and 
sc constants are given in Tables 10.6 and 10.7. These constants are defined in 
symconsth. 

index. The index is an offset into either Local Symbols or Auxiliary Symbols, 
depending of the storage type (st) as shown in Table 10.5 The compiler uses 
isymBose in the file descriptor entry as the base for a Local Symbol entry and 
iauxBose for an Auxiliary Symbols entry. 

Table 10 J Index and Value as a Function of Symbol Type and Storage Class 



Symbol Type 


Storage Class 


Index 


Value 


stFile 


scText 


isymMac 


address 


stLabel 


scText 


indexNil 


address 


stGlobal 


scD/B 1 


iaux 


address 


stStatic 


scD/B 1 


iaux 


address 


stParam 


scAbs 


iaux 


frame offset - 




scRegister 


iaux 


register number 




scVar 


iaux 


frame offset 2 




scVarRegister 


iaux 


register number 


stLocal 


scAbs 


iaux 


frame offset 2 




scRegister 


iaux 


register number 


stProc 


scText 


iaux 


address 




scNil 


iaux 


address 




scUndefined 


iaux 


address 


stStaticProc 


scText 


iaux 


address 


stMember 








enumeration 


sclnfo 


indexNil 


ordinal 


structure 


sclnfo 


iaux 


bit offset 


union 


sclnfo 


iaux 


bit offset 


1 scD/B is the storage class determined by the assembler, e 


ither 


largo/small or data/bss. 




2 frame offset is the offset from the virtual frame pointer. 




3 bit offset is computed from the beginning of the procedur 


e. 
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Table 10.5 Index and Value as a Function of Symbol Type and Storage Class (continued) 



Symbol Type 


Storage Class 


Index 


Value 


stBlock 




i 




enumeration 


sclnfo 


isymMac 


max enumeration 


structure 


sclnfo 


isymMac 


size 


textbook 


scText 


isymMac 


relative address 2 


common block 


scCommon 


isymMac 


size 


variant 


scVariant 


isymMac 


isymTag 3 


variant arm 


sclnfo 


isymMac 


iauxRanges 


union 


sclnfo 


isymMac 


size 


stEnd 








enumeration 


sclnfo 


isymStart 5 


2 

relative address 


file 


scText 


isymStart 


procedure 


scText 


isymStart 


relative address 2 


structure 


sclnfo 


isymStart 





text block 


scText 


isymStart 


relative address 


union 


sclnfo 


isymStart 





common block 


scCommon 


isymStart 





variant 


scVariant 


isymStart 





variant arm 


sclnfo 


isymStart 





stTypedef 


sclnfo 


iaux 





IsymMac is the isy 


m of the corresponding stEnd symbol plus 1. 


^relative address is f 
procedure. 


be relative displacement from the beginning of the 


3 

isymTab is the isyr 


n to the symbol that is the tag for the 


variant 


iauxRanges is the iaux to ranges for the variant arm. 




5 isymStartistheisy 


m of the correspodning begin block 


(stBlock, stFile, 


stProc, etc.) 







( 



( 



The link editor ignores all symbols except the types stProc, st Static, stLabel, 
stStaticProc, which it will relocate. Other symbols are used only by the debug- 
ger, and need be entered in the table only when the compiler debugger option is 

ON. 

Symbol Type (st). Table 10.6 gives the allowable constants that can be speci- 
fied in the st field of Local Symbols entries; the symconst.h header file contains 
the declaration for the constants. 



( 
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Table 10.6 Symbol Type (st) Constants Supported by the Compiler 



Constant 



Value Description 



stNil 





Dummy entry 


stGlobal 


1 


external symbol 


stStatic 


2 


static 


stParam 


3 


procedure aigument 


stLocal 


4 


local variable 


stLabel 


5 


label 


stProc 


6 


Procedure 


stBlock 


7 


start of block 


stEnd 


8 


end block, file, or procedures 


stMember 


9 


member of structure, union, or enumeration, 


stTypedef 


10 


type definition 


stFile 


11 


filename 


stStaticProc 


14 


load time only static procs 


stConstant 


15 


const 



Storage Class (st) Constants- Table 10.7 gives the allowable constants that 
can be specified in the sc field of Local Symbols entries; the symconsth header 
file contains the declaration for the constantSe 
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Table 107 Storage Class Constants Supported by the Compiler 



Constant 


Value 


Description 


scNil 





dummy entry. 


scText 


1 


text symbol 


scData 


2 


initialized data symbol 


scBss 


3 


uninitialized data symbol 


scRegister 


4 


value of symbol is register number 


scAbs 


5 


symbol value is absolute; not to be relocated. 


scUndefined 


6 


Used but undefined in the current module. 


reserved 


7 




scBits 


8 


this is a bit field 


scDbx 


9 


dbx internal use 


scReglmage 


10 


register value saved on stack 


sclnfo 


11 


symbol contains debugger information 


scUserStruct 


12 


address in struct user for current process 


scSData 


13 


(load time only) small data 


scSBss 


14 


(load time only) small common 


scRData 


15 


(load time only) read only data 


scVar 


16 


Var parameter (Fortran or Pascal) 


scCommon 


17 


common variable 


scSCommon 


18 


small common 


scVarRegister 


19 


var parameter in a register 


scVariant 


20 


variant records 


scUndefined 


21 


small undefined 


sclnit 


22 


init section symbol 



Optimization Symbols 



C 



( 



Reserved for future use. 
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Auxiliary Symbols 



Table 10.8 shows the format of an entry, which is a union, in Auxiliary Sym- 
bols; the sym.h file contains its declaration. 

Table 10.8 Storage Class Constants Supported by the Compiler 



Declaration 


Name 


Description 


TIR 


ti 


type information record 


RNDXR 


rndx 


relative index into local symbols 


long 


dnLow 


low dimension 


long 


dnHigh 


high dimension 


long 


isym 


index into local symbols for stEnd 


long 


iss 


index into local strings (not used) 


long 


width 


width of a structure field not declared 
with the default value for size. 


long 


count 


count of ranges for variant arm 



All of the fields except the ti field are explained in the order they appear in the 
above layout. The ti field is explained last. 

rndx. Relative File Index. The front-end fills this field in describing structures, 
enumerations, and other complex types. The relative file index is a pair of in- 
dexes. One index is an offset from the start of the File Descriptor table to one of 
its entries. The second is an offset from the file descriptor entry to an entry in the 
Local Symbols or Auxiliary Symbols table. 

dnLow. Low Dimension of Array. 

dnHigh. High Dimension of Array. 

isym. Index into Local Symbols. This index is always an offset to an stEnd en- 
try denoting the end of a procedure. 

width. Width of Structured Fields. 

count. Range Count. Used in describing case variants. Gives how many ele- 
ments are separated by commas in a case variant. 

ti. Type Information Record. Table 10.9 shows the format of a ti entry; the 
sym.h file contains its declaration. 
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Table 103 Format of an Type Information Record Entry 



c 



Declaration 



Name 



unsigned 


fflitfield:! 


unsigned 


continued : 1 


unsigned 


bt :6 


unsigned 


tq4 


4 


unsigned 


tq5 


4 


unsigned 


tqO 


4 


unsigned 


tql 


4 


unsigned 


tq2 


4 


unsigned 


tq3 


4 



Description 



setif bit width is specified 
next auxiliary entry has tq info 
basic type 
Type qualifier. 



All groups of auxiliary entries have a type inforaiation record with the following 
entries: 

• fbitfield. Set if the basic type (bt) is of non-strandard width. 

• bt (for basic type) specifies if the symbol is integer, real com- 
plex, numbers , a structure, etc. The valid entries for this field are 
shown in Table 10. 10; the sym.h file contains its declaration. 

• tq (for type qualifier) defines whether the basic type (bt) has an 
array of function returning, or pointer to qualifier. The valid 
entries for this field are shown in Table 10.1 1 ; the sym.h file 
contains its declaration. 



c 



( 
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Table 10.10 Basic Type (bt) Constants 







Default 




Constant 


Value 


Size* 


Description 


blNil 








undefined, void 


btAdr 


1 


32 


address - same size as pointer 


btChar 


2 


8 


symbol character 


btUChar 


3 


8 


unsigned character 


btShort 


4 


16 


short (16 bits) 


btUShort 


5 


16 


unsigned short 


btlnt 


6 


32 


integer 


btUInt 


7 


32 


unsigned integer 


biLong 


8 


32 


long (32 bits) 


btULong 


9 


32 


unsigned long 


btFloat 


10 


32 


floating point (real) 


btDouble 


11 


64 


double-precision floating point real 


btS tract 


12 


n/a 


structure (Record) 


btUnion 


13 


n/a 


union (variant) 


btEnum 


14 


32 


enumerated 


btTypedef 


15 


n/a 


defined via a typedef; rndx points at 
a stTypedef symbol. 


btRange 


16 


32 


subrange of integer 


btSet 


17 


32 


pascal sets 


btComplex 


18 


64 


fortran complex 


btDComplex 


19 


128 


fortran double complex 


btlndirect 


20 




Indirect definition;rndx points to TTR aux 


btMax 


64 






♦Size in bits, 








Table 10.11 Type Qualifier (tq) Constants 


Constant 


Value 


Description 


tqNil 







Placeholder. No qualifier. 


tqptr 


1 




pointer to 


tqProc 


2 




function returning 


tqArray 


3 




array of 


tqVol 


5 




volatile 


tqMax 


8 
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File Descriptor Table 



Table 10. 12 shows the format of an entry in the File Descriptor table; the sym.h 
file contains its declaration. 

Table 10.12 Format of File Descriptor Entry 



c 



Declaration 



Name 



Description 



unsignecUong 


adr 


long 


rss 


long 


issBase 


long 


cbSs 


long 


isymBase 


long 


csym 


long 


ilineBase 


long 


cline 


long 


ioptBase 


long 


copt 


short 


ipdFirst 


short 


cpd 


long 


iauxBase 


long 


caux 


long 


rfdBase 


long 


erfd 


unsigned 


lang:5 


unsigned 


fMerge : 1 


unsigned 


fReadin : 1 


unsigned 


ffiigendian : 1 


unsigned 


reserved: 22 


long 


cbLineOffset 


long 


cbLine 



memory address of start of file 

source file name 

start of local strings 

number of bytes in local strings 

start of local symbol entries 

count of local symbol entries 

start of line number entries 

count of line number entries 

start of optimization symbol entries 

count of optimization symbol entries 

start of procedure descriptor table 

count of procedures descriptors 

start of auxiliary symbol entries 

count of auxiliary symbol entries 

index into relative file descriptors 

relative file descriptor count 

language for this file 

whether this file can be merged 

true if it was read in (not just 

created) 

if set, was compiled on big endian 

machine aux's is in compile host's sex 

reserved for future use 

byte offset from header or file In's 



c 



External Symbols 



The External Symbols table has the same format as Local Symbols, except an 
offset (ifd) field into the File Descriptor table has been added. This field is used 
to locate information associated with the symbol in an Auxiliary Symbols table. 
Table 10.13 shows the format of an entry in External Symbols; the sym.h file 
contains its declaration. 



( 
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Symbol Table 

Table 10.13 Format an Entry in External Symbols 

Declaration Name Description 

reserved for future use 
pointer to file descriptor entry 
Same as Local Symbols 



short 


reserved 


short 


ifd 


SYMR 


asym 
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A 
Instruction Summaries 



ms^i$mmg%m®$mmm 



The tables in this chapter summarize the assembly language instruction set. Most 
of the assembly language instructions have direct machine equivalents. Refer to 
Appendix A and Appendix B of the RISC Architecture book for detailed instruc- 
tion descriptions. In the tables in this appendix, the operand terms have the fol- 
lowing meanings: 



Operand 


Description 


destination 


destination register 


address 


expression 


source 


source register 


expression 


aboslute value 


immediate 


immediate value 


label 


symbol label 


breakcode 


value that determines the break 



Table A.l Main Processor Instruction Summary 



Description 


Op-code 


Operand 


Load Address 


la 


destination,address 


Load Byte 

Load Byte Unsigned 

Load Halfword 


lb 

lbu 

Ih 




Load Halfword Unsigned 
Load Word 


Ihu 
Iw 




Load Coprocessor z 
Load Word Left 


Iwcz 
Iwl 




Load Word Right 


Iwr 




Store Byte 
Store Halfword 


sb 
sh 


source,address 


Store Word 


sw 




Store Word Coprocessor z 
Store Word Left 


swcz 
swl 




Store Word Right 
Unaligned Load Halfword 
Unaligned Load Halfword Unsigned 
Unaligned Load Word 


swr 
ulh 
ulhu 
ulw 




Unaligned Store Halfword 
Unaligned Store Word 

; „ 


ush 
usw 
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Table A J Main Processor Instruction Summary (continued) 






Description 


Op-code 


Operand 






Load Immediate 


li 


destination,expression 






Load Upper Immediate 


lui 








Restore From Exception 


rfe 








Syscall 


syscall 








Absolute Value 


abs 


destination,srcl 






Negate (with overflow) 


neg 


destination/srcl 






Negate (without overflow) 


negu 








NOT 


not 








Add (with overflow) 


add 


destinations 1 ,src2 






Add (without overflow) 


addu 


destination/srcl ,src2 






AND 


and 


destination,srcl immediate 






Divide (with overflow) 


div 


destination/srcl , immediate 






Divide (without overflow) 


divu 








EXCLUSIVE OR 


xor 








Multiply 


mul 








Multiply (with overflow) 


mulo 








Multiply (with overflow) 


mulou 








Unsigned 










NOT OR 


nor 








OR 


or 








Remainder 


rem 








Remainder Unsigned 


remu 








Rotate Left 


rol 








Rotate Right 


ror 








Set Equal 


seq 








SEt Less Than 


sit 








Set Less Than Unsigned 


situ 








Set Less/Equal 


sle 








Set Less/Equal Unsigned 


sleu 








Set Greater Than 


sgt 








Set Greater Than Unsigned 


sgut 








Set Greater/Equal 


sge 








Set Greater/Equal Unsigned 


sgeu 








Set Not Equal 


sne 








Shift Left Logical 


sll 








Shift Right Arithmetic 


sra 








Shift Right Logical 


srl 








Subtract (with overflow) 


sub 








Subtract (without overflow) 


subu 








Multiply 


mult 


srcl,src2 






Multiply Unsigned 


multu 







( 
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Table A.L Main Processor Instruction Summary (continued) 



Description 


Op-code 


Operand 


Branch 

Branch Coprocessor z True 

Branch Coprocessor z False 


b 

bczt 

bczf 


label 


Branch on Equal 

Branch on Greater 

Branch on Greater/Equal 

Branch on Greater/Equal Unsigned 

Branch on Greater Than Unsigned 

Branch on Less 

Branch on Less/Equal 

Branch on Less/Equal Unsigned 

Branch on Less Than Unsigned 

Branch on Not Equal 


beq 

bgt 

bge 

bgeu 

bgtu 

bit 

ble 

bleu 

bltu 

bne 


srcl,src2,label 
srcl ,immediate,label 


Branch and Link 


bal 


label 


Branch on Equal Zero 
Branch on Greater/Equal Zero 
Branch on Greater or Equal to zero 
and Link 

Branch on Greater Than Zero 
Branch on Less/Equal Zero 
Branch on Less Than Zero 
Branch on Less Than Zero and Link 
Branch on Not Equal Zero 


beqz 
bgez 
bgeza! 

bgtz 

blez 

bltz 

bltzal 

bnqz 


srcl, label 


Jump 

Jump and Link 


J 
jal 


address 
srcl 


Break 


break 


breakcode 


Coprocessor z Operation 


cz 


expression 


Move 


move 


destinations 1 


Move From HI Register 
Move To HI Register 
Move From LO Register 
Move To LO Register 


mfhi 
mthi 
mflo 
mtlo 


register 


Move From Coprocessor z 
Move To Coprocessor z 


mfcz 
mtcz 


dest-gpr, source 
src-gpr, destination 


Control From Coprocessor z 
Control to Coprocessor z 


cfcz 
ctcz 


src-gpr, destination 
dest-gpr, source 
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Table A 2 System Coprocessor Instruction Summary 



Description 


Op-code 


Operand 


Translation Lookaside Buffer Probe 
Translation Lookaside Buffer Read 
Translation Lookaside Buffer Write Random 
Translation Lookaside Write Index 


tlbp 
tlbr 
tlbwr 
tlbwi 





c 



Table A3 Floating Point Instruction Summary 



Description 


Op-code 


Operand 


Load Fp 

Double 
Single 


Id 
Ls 


destination,offset(base) 


Store FP 

Double 
Single 


s.d 

s.s 


source,offset(base) 


Absolute Value Fp 

Double 
Single 


abs.d 
abs.s 


destination,srcl 


AddFp 

Double 

Single 
Divide Fp 

Double 

Single 
Multiply 

Double 

Single 
Subtract Fp 

Double 

Single 


add.d 
add.s 

div.d 
div.s 

muLd 
mul.s 

sub.d 
sub.s 


destination,src 1 ,src2 



c 
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Table A3 Floating Point Instruction Summary (continued) 



Description 


Op-code 


Operand 


Convert Source to 
Specified Precision Fp 

Double to Single 
Fixed Point to Single 
Fixed Point to Double 
Single to Double 


cvt.s.d 
cvt.s.w 
cvt.d.w 
cvt.d.s 


destination,src2 


Double to Fixed Point 


cvt.w.d 




Single to Fixed Point 


cvt.w.s 




Negate Floating Point 






Double 


neg.d 




Single 


neg.s 
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Table A3 Floating Point Instruction Summary (continued) 



Description 



Compare Fp 

F Single 
F Double 

UN Single 
UN Double 

*EQ Single 
*EQ Double 

UEQ Single 
UEQ Double 

OLT Single 
OLT Double 

ULT Single 
ULT Double 

OLE Single 
OLE Double 

ULE Single 
ULE Double 

SF Single 
SF Double 

NGLE Single 
NGLE Double 

SEQ Single 
SEQ Double 

NGL Single 
NGL Double 



Op-code 



cXs 
cXd 

C.IU1.S 

c.un.d 

c.eq.s 
c.eq.d 

c.ueq.s 
c.ueq.d 

c*olts 
colt.d 

cults 
c.ultd 

c.ole.s 
cole.d 

c.ule.s 
cule.d 

c.sf.s 
c.sf.d 

c.ngle.s 
cjngle.d 

c.deq.s 
c.seq.d 

c.ngLs 
c.ngl.d 



Operand 



srcl,src2 



c 



c 



NOTE: Starred items (*) are the most common Compare instructions. The ma- 
chine has the other Compare instructions for IEEE compatibility. 
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Table A3 Floating Point Instruction Summary (continued) 



Description 


Op-code 


Operand 


Compare Fp 

*LT Single 
*LT Double 

NGE Single 
NGE Double 

*LE Single 
*LE Double 

NGT Single 
NGT Double 


cits 
c.lt.d 

c.nge.s 
c.nge.d 

c.Ie.s 
c.le.d 

cngt.s 
cngt.d 


srcl,src2 


Move Fp 

Single 
Double 


mov.s 
mov.d 


destination,srcl 



NOTE: Starred items (*) are the most common Compare instructions. The ma- 
chine has the other Compare instructions for IEEE compatibility. 
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B 
Basic Machine Definition 



The assembly language instructions described in this book are a superset of the 
actual machine instructions. Generally, the assembly language instructions 
match the machine instructions; however, in some cases the assembly language 
instruction are macros that generate more than one machine instruction (the as- 
sembly language multiplication instructions are examples). 

You can, in most instances, consider the assembly instructions as machine in- 
structions; however, for routines that require tight coding for performance rea- 
sons, you must be aware of the assembly instructions that generate more than one 
machine language instruction, as described in this appendix. 



Load and Store Instructions 



If you use an address as an operand in an assembler Load or Store instruction 
and the address references a data item that is not addressable through register $gp 
or the data item does not have an absolute address in the range -32768...32767, 
the assembler instruction generates a lui (load upper immediate) machine instruc- 
tion and generates the appropriate offset to $at. The assembler then uses $at as 
the index address for the reference. This condition occurs when the address has a 
relocatable external name offset (or index) from where the offset began. 

The assembler's la (load address) instruction generates an addiu (add unsigned 
immediate) machine instruction. If the address requires it, the la instruction also 
generates a lui (load upper immediate) machine instruction. The machine re- 
quires the la instruction because la couples relocatable information with the in- 
struction for symbolic addresses. 

Depending on the expression's value, the assembler's li (load immediate) instruc- 
tion can generate one or two machine instructions. For values in the 
-32768...6553S range or for values that have zeros as the 16 least significant bits, 
the li instruction generates a single machine instruction;, otherwise it generates 
two machine instructions.. 



Computational Instructions 



If a computational instruction immediate value falls outside the 0...65535 range 
for Logical ANDs, Logical ORs, or Logical XORs (exclusive or), the immediate 
field causes the machine to explicitly load a constant to a temporary register. 
Other instructions generate a single machine instruction when a value falls in the 
-32768„e.32767 range. 

The assembler's seq (set equal) and sne (set not equal) instructions generate 
three machine instructions each. 
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Special Instructions 



The assembler's break instruction packs the breakcode operand in unused regis- 
ter fields. An operating system convention determines the position. 



c 



Appendix B 

If one operand is a literal outside the range ~32768...32767, the assembler's sge 
(set greater than or equal to) and sle (set less/equal) instructions generate two ma- 
chine instructions each. 

The assembler's mulo and mulou (multiply) instructions generate machine in- 
structions to test for overflow and to move the result to a general register; if the 
destination register is $0, the check and move are not generated. 

The assembler's mul (multiply unsigned) instruction generates a machine in- 
struction to move the result to a general register; if the destination register is $0, 
the move and divide-by-zero checking is not generated. The assembler's divide 
instructions, div (divide with overflow) and divu (divide without overflow), gen- 
erate machine instructions to check for division by zero and to move the quotient 
into a general register; if the destination register is $0, the move is not generated. 

The assembler's rem (signed) and remu (unsigned) instructions also generate 
multiple instructions. 

The rotate instructions ror (rotate right) and rol (rotate left) generate three ma- 
chine instructions each. 

The abs (absolute value) instruction generates three machine instructions. 

Branch Instructions 

If the immediate value is not zero, the branch instructions beq (branch on equal) 
and bne (branch on not equal), each generate a load literal machine instruction. f 

The relational instructions generate a sit (set less than) machine instruction to ^ 

determine whether one register is less than or greater than another. Relational 
instructions can reorder the operands and branch on either zero or not zero as re- 
quired to do an operation. 

Coprocessor Instructions 

For symbolic addresses, the coprocessor interface Load and Store instructions, 
lcz (load coprocessor z) and scz (store coprocessor z) can generate a lui (load up- 
per immediate) machine instruction. 
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absolute instructions 
abs (absolute value), 5-9 
abs.d (add fp double), 6-6 
abs.s (add Fp single), 6-6 

add instructions 
add (with overflow), 5-9 
addu (without overflow), 5-9 

addressing 
descriptions, 2-2 
formats, 2-2 

archive files, 9-23 

auxilary symbols, 10-17 

auxiliary symbols, 10-4 



B 



branch and jump instructions, jal O'wnp and link), 5-18 

branch instructions, B-2 
b (branch), 5-16 
bal (branch and link), 5-16 
bczf (branch when coprocessor false), 5-20 
bczt (branch when z true), 5-20 
beq (begin on equal), 5-16 
beqz ( equal to zero), 5-16 
bge (greater or equal), 5-16 
bgeu (greater/equal unsigned), 5-16 
bgez (greater/equal zero), 5-16 
bgezal (greater/equal zero and link), 5-16 
bgt (on greater), 5-16 
bgtu (greater than unsigned), 5-16 
bgtz (greater than zero), 5-17 
ble (less or equal), 5-17 
bleu (less/equal unsigned), 5-17 
blez (less/equal zero), 5-17 
bit (branch on less), 5-17 
bltu (less than unsigned), 5-17 
bltz (less than zero), 5-17 
bltzal (less than zero and link), 5-17 
bne (not equal), 5-17 
bnez (not equal to zero), 5-17 
filling delay slot, 5-1 

break, 5-19 



C 

C programs, calling, 7-1 



Computational Instructions, description of, 5-7 

calling programs in other languages, 7-1 

cfcz (control from processor z), 5-20 

comments, 4-1 

compare filing point instructions, c.sf.d (SF double), 
6-9,6-11 

compare floating point instructions 
c.eq.d (EQ double), 6-9, 6-10 
c.eq.s (EQ single), 6-9, 6-10 
c.f.d(f double), 6-9, 6-10 
c.f.s(f single), 6-9, 6-10 
c.le.d (LE double), 6-10 
c.le.s (LE single), 6-10 
c.ltd (LT double), 6-10, 6-1 1 
c.lt.s (LT single), 6-10, 6-11 
c.nge.s (NGE single), 6-10, 6-1 1 
c.ngl.d (NGL double), 6-10, 6-1 1 
c.ngl.s (NGL single), 6-10, 6-11 
c.ngle.d (NGLE double), 6-10, 6-1 1 
c.ngle.s (NGLE single), 6-10, 6-11 
c.ngt.d (NGT double), 6-10, 6-11 
c.ngt.s (NGT single), 6-10, 6-11 
c.ole.d(OLE double), 6-9, 6-11 
c.ole.s (OLE single), 6-9, 6-1 1 
c.olt.d (OLT double), 6-9,6-11 
c.olt.s (OLT single), 6-9, 6-1 1 
c.seq.d (SEQ double), 6-10, 6-1 1 
c.seq.s (SEQ single), 6-10, 6-11 
c.sf.s(SF single), 6-9, 6-11 
c.ueq.d (UEQ double), 6-9, 6-12 
c.ueq.s ( UEQ single), 6-9, 6-12 
c.ule.s (ULE single), 6-9, 6-12 
c.ult.d (ULT double), 6-9, 6-12 
cults (ULT single), 6-9, 6-12 
c.um.s (UN single), 6-9, 6-12 
c.un.d (UN double), 6-9, 6-12 

comparing floating point instructions, c.ule.d (ULE 
double), 6-9, 6-12 

computational instructions, B-l 
formats, 5-7 

constants 
floating point, 4-2 
scalar, 4-2 
string, 4-3 

convert instructions 
cvtd.s (source single to double), 6-4 
cvt.d.w (source fixed point to double), 6-4 
cvt.s.d (source single to double), 6-4 
cvt.w.d (source double to fixed point), 6-4 
cvt.w.s (souce single to fixed), 6-4 

coprocessor instruction set, 6-1 
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coprocessor instructions, B-2 

coprocessor interface instructions 
description of, 5-20 
formats, 5-19 

counters, 4-4 

ctcz (control to coprocessor z), 5-20 

cycles per instruction, 5-1 

cz ( coprocessor z operation), 5-20 

D 

data types, 4-7 

dense numbers, 10-3 

divide instructions 
div (signed), 5-9 

div.d (divide Fp double), 6-4, 6-6 
div.s (divide Fp single), 6-4, 6-6 
divu (unsigned), 5-10 



exceptions 
data types, 4-7 
floating point, 3-1 
main processor, 3-1 
operators, 4-7 
precedence, 4-7 
type propagation, 4-9 

external strings, 10-5 

external symbols, 10-6, 10-20 



file descriptor table, 10-6, 10-20 

file header 
file header magic field (fjnagic), 9-6 
flags (s_flags), 9-8 

floating point 
constants, 4-2 
formats, 6-4 
instructional descriptions, 6-5 

floating point relational operations 
formats, 6-9 

instruction descriptions, 6-10 
move formats, 6-12 
move instruction descriptions, 6-12 



format of symbol table entries, 10-7 

G 

gp (global pointer) tables, 9-9 

I 

identifiers, 4-2 
instruction notation, 6-1 
instruction pipelines, 5-1 
instruction summaries, A-l 
instructions, constraints and rules, 5-1 



jump and branch formats, formats, 5-15 
jump and branch instructions, j (jump), 5-18 
jump instructions, delayed, 5-1 



( 



K 

keyword statements, 4-6 



Load Instructions, descriptions of, 5-3 

label definitions, 4-6 

language interfaces, 7-1 

li (floating point immediate), 6-3 

line numbers, in symbol table, 10-3, 10-9 

link editor defined symbols, 9-24 

linkage conventions 
examples, 7-7 
general, 7-2 

language interfaces, 7-1 1 
memory allocation, 7-15 

load and store instructions, B-l 
descriptions, 6-3 
formats, 6-3 

load instructions 
delayed, 5-1 
formats, 5-3 
la (load address), 5-4 
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lb (load byte), 5-4 

lbu (load byte unasigned), 5-4 

Id (load double), 5-4 

Ih (load halfword), 5-4 

lhu (load halfword unsigned), 5-4 

li (load immediate), 5-4 

lui (load upper immediate), 5-4 

lw (word), 5-5 

lwcz (word coprocessor z), 5-21 

Iwl (load word left), 5-5 

Iwr (load word right), 5-5 

ulh (unaligned load halfword), 5-5 

ulha (load unsigned halfword), 5-5 

ulw (unaligned load word), 5-5 

local strings, 10-4 

local symbols, 10-4 



M 



move instructions 
mfcz (from coprocessor), 5-21 
mfhi (from HI register), 5-19 
mflo (from LO register), 5-19 
mov.d (Fp double), 6-12 
mov.s (Fp single), 6-12 
move (MOVE), 5-10 
mtcz (to coprocessor), 5-21 
mthi (to HI register), 5-19 
mtlo (to LO register), 5-19 

multiply instructions 
mul (Multiply 32-bit product), 5-10 
mul.d (multiply double), 6-6 
mul.s (multiply single), 6-6 
mulo (multiply with overflow), 5-11 
mulou (multiply with overflow unsigned), 5-11 
mult (Multiply 64-bit product), 5-10 
multu (multiply unsigned), 5-10 



N 

NMAGIC files, 9-6 

negate instructions 
neg (with overflow), 5-11 
neg.d (floating point double), 6-7 
neg.s (floating point single), 6-7 
negu (without overflow), 5-1 1 

nor (NOTOR), 5-11 

not (NOT), 5-11 

null statements, 4-6 



OMAGIC files, 9-18 

object file format, 9-1, 9-2 

object file types 
demand paged (ZMAGIC) files, 9-20 
impure format (OMAGIC) files, 9-18 
shared text (NMAGIC) files, 9-19 

object files types, target shared library (LIBMAGIC) 
files, 9-22 

object files, loading 
boundary constraints, 9-17 
description, 9-23 

operators, 4-7 

optimization symbols, 10-4 

optional header, 9-5 

optional header magic field (magic), 9-6 

or (OR), 5-11 



Pascal progams, calling, 7-1 
performance, maximizing, 5-1 
pipelines, instruction, 5-1 
precedence, 4-7 

procedure descriptor table, 10-4, 10-12 
program design, 7-1 



register and control status, exception trap processing, 
6-15 

register control status, floating point rounding, 6-18 

register use and linkage, 7-2 

registers 
floating point, 1-5 
format, on big endian, 1-1 
format, on little endian, 1-2 
general, 1-3, 1-5 

relative file descriptor, 10-6 

rem (remainder signed), 5-12 

remu (remainder unsigned), 5-12 

rfe (restore from exception), 5-19 

rol (rotate left), 5-12 
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ror (rotate right), 5-12 
round.w.s (round), 6-6 
roundd.w.d (round), 6-6 



s.s (store Fp single), 6-3 

scalar constants, 4-2 

section data, 9-7, 9-10 

section headers 
flags(s Jlags), 9-8 
section name (sjname), 9-7 

section relocation information 
assembler and link editor processing, 9-12 
relocation entry, 9-1 1 
relocation table entry, 9-13 

set instructions 
seq (set equal), 5-12 
sge (greater/equal), 5-13 
sgeu (greater/equal unsigned), 5-13 
sgt (set greater), 5-13 
sgtu (greater unsigned), 5-13 
sle (less/equal), 5-13 
sleu (less/equal unsigned), 5-13 
sit (less), 5-13 
situ (less unsigned), 5-13 
sne (set not equal), 5-14 

shared libraries, in objects, 9-9, 9-22 

shift instructions 
all (shift left logical), 5-14 
sra (shift right arithmetic), 5-14 
srl (shift right logical), 5-14 

special instructions, B-2 
descriptions of, 5-19 
formats, 5-18 

stack frame, 7-4 

statements, 4-6 

status register, 6-13 



store instructions 
description of, 5-6 
formats, 5-3 

s.d (store Fp double), 6-3 
sb (store byte), 5-6 
sd (store double), 5-6 
sh (store halfword), 5-6 
sw (store word), 5-6 
swcz (store word coprocessor z), 5-21 
swl (store word left), 5-6 
swr (store word right), 5-7 
ush (unaligned store halfword), 5-7 
usw (unaligned store word), 5-7 

string constants, 4-3 

subtract instructions 
sub (with overflow), 5-14 
sub.d (Fp double), 6-4, 6-7 
sub.s (Fp single), 6-4, 6-7 
subu (without overflow), 5-14 

symbol table, 10-1 

symbolic header, 10-3, 10-7 

syscall, 5-19 

system control coprocessor instructions 
descriptions, 6-13 
formats, 6-13 
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tokens, 4-1 

type propagation, 4-9 



xor (EXCLUSIVE OR), 5-10 



ZMAGIC files, 9-20 
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Customer Response Card 

Your comments, which can assist us in improving our products and our 
publications, are welcome. 

If you wish to reply, be sure to include your name and address, and the name 
and part number that appears on the first page of this manual. 

Thank you for your cooperation. 

No postage necessary if mailed in the U. S. A. 

After writing comments, detach this page and then fold, seal, and mail. 
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Name of manual: 
Part number: 



MIPS may use and distribute any of the information you supply in any way it 
believes appropriate without incurring any obligation whatever. You may. of 
course, continue to use the information you supply. 
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